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Chapter  1 


Introduction 


In  this  thesis  we  study  the  significance  levels  and  powers  of  repeated  significance  test 
(RST).  A  P.ST  is  the  sequential  version  of  the  generalized  livelihood  ratio  test.  To  be  more 
precise,  let  us  consider  the  following  testing  problems  Let  X\,X2 ,  •••  be  i.i.d.  according  to 
the  distribution  function  Ft  where  {/#}  form  a  multiparameter  exponential  family.  By  that 
we  mean  Ft  has  the  form  F$(dx)  =  Fo{dx)  for  some  smooth  function  V’(-)  from  the 

parameter  space  9  into  i?  and  some  distribution  function  F0  over  Rd.  It  is  well  known  that 
EfXi  —  p(9)  =  V\i(0).  Moreover,  there  is  no  loss  of  generality  to  assume  that  p(0)  =  0. 
Sometimes  it  is  convenient  to  index  this  family  by  p  and  write  Fp.  Let  0O  be  a  proper  subset 
of  0  C  R?.  If  we  want  to  test  Ho  :  0  €  0o  against  Hi  :  0  £  0o-  The  generalized  log  likelihood 
ratio  statistic  after  observing  Xit  X2,  •  •  • ,  Xn  for  this  testing  problem  is 

r»A(S„/n)  =  sup4>(0)  -  sup  tn(0)  =  n^(S„/n)  -  n<f>o{S„/n) 
tee  *ee0 

where  tn(9)  =  9'Sn  -  nif>(9)  is  the  log  likelihood  after  observing  Xi,-  •  • ,  X„,  S„  = 
and 

<fi(x)  =  sup \?x  -  4>(*)b  4>o(*)  -  8«P  \*x  ~  W)h 

tee  tee  0 

The  RST  is  defined  in  terms  of  the  following  stopping  rule 

T  =  inf {n  >  mo,  r»A(S„/n)  >  a}. 

It  stops  sampling  at  T  Am  and  rejects  Ho  when  T  <  m.  The  significance  level  and  power  of 
the  RST  are  given  by 

max  P,{T  <  to) 
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and  Pt{T  <  m}  6  $  Qq  where  P$  denotes  the  probability  law  under  which  Xi,X*,  •  •  •  are  i.i.d. 
according  to  the  distribution  function  F$.  In  some  cases  when  one  expects  a  small  deviation 
from  the  null  hypothesis  and  wants  to  increase  the  power,  one  may  use  a  modified  version  of 
the  RST.  The  MRST  rejects  H0  when  either  T  <  m  or  T  >  m  and  mA(Sm/m)  >  c  for  some 
c  <  a. 

Observe  that  when  we  fix  e  and  let  a  tend  to  oo  then  it  is  unlikely  that  the  log  likelihood 
ratio  process  nA(S„/n)  will  cross  the  level  a  before  time  m  and  the  rejection  region  of  the 
corresponding  MRST  reduces  essentially  to  {mA (S,n/m)  >  e}  which  is  exactly  the  rejection 
region  of  a  fixed  sample  test.  On  the  other  hand  if  we  set  a  =  c  then  the  corresponding 
MRST  is  just  RST.  So  the  MRST  can  be  thought  of  as  a  family  of  tests  interpolating  the 
fixed  sample  size  test  and  the  RST. 

Underlying  this  interpolation,  there  is  a  trade-off  between  expected  sample  size  and 
power,  that  is,  as  a  moves  from  e  to  oo  the  power  of  MRST  increases  to  that  of  a  fixed  sample 
test  at  the  cost  of  increasing  the  expected  sample  size.  So  with  MRST  at  hand  the  designer 
of  an  experiment  has  one  more  degree  of  freedom  to  choose  in  fulfilling  his  needs.  If  he  thinks 
power  is  more  important  he  may  choose  a  MRST  with  a  substantially  larger  than  c.  If  smaller 
expected  sample  size  is  desired  he  may  choose  a  close  to  e. 

The  power  of  the  MRST  is  given  by 

Pt{T  <  r<i}  +  Pt{T  >  to,  mA[Sm/m)  >  c } 

(1) 

=  P<{toA(Sto/to)  >  c}  +  Pt{T  <  to,  rnA(Sm/rn)  <  c) 

The  quantity  (I)  also  appears  on  other  occasions.  Siegmund  (I9S5)  suggests  defining  the 
attained  significance  levels  for  a  RST  as  follows: 

(i)  If  T  -  mo  and  moA(Smo/mo)  =  z  >  a  then  the  attained  level  is  aup6o  P#{tooA(SWo/too) 

>*}• 

(ii)  If  T  =  n  €  (too,  to]  the  attained  level  is  supeo  P{T  <  n}. 
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(iii)  If  T  >  m  and  mA(5m/rn)  =  c  then  the  attained  level  is 

sup[P(j{T  <  m}  +  P${T  >  m,  mk{Sm/m)  >  e}] 

©o 

=  sup[Ps{rnA(5m/m)  >  e}  +  Pt{T  <  m,  mA(5ro/m)  <  e}]. 
e0 

In  case  (iii)  above  the  attained  significance  level  is  of  the  same  form  as  (1). 

In  this  thesis  we  only  consider  a  special  kind  of  ©o,  e.g. 

©o  —  {0  \  6 1  *  —  0}  di  <  d 

By  reparametrization  ©o  can  be  generalized  to  ©J,  =  {AO  :  0  6  ©}  where  A  is  a  d  x  d  matrix. 
Typically  the  significance  level  and  power  of  the  RST  and  MRST  cannot  be  computed  exactly 
and  some  sort  of  approximation  is  required.  Approximations  for  significance  levels  of  the 
RST  in  exponential  families  have  been  provided  by  Woodroofe  (1978)  and  Lalley  (1983). 
Their  setting  is  more  general  than  that  above,  but  their  methods  are  not  as  successful  in 
approximating  the  power  of  the  RST  and  to  power  and  significance  level  of  the  MRST.  In 
what  follows  we  shall  exhibit  with  a  simple  example  three  methods  which  have  been  developed 
by  previous  authors.  Let  Xj ,  X2 ,  ••  •  be  i.i.d.  according  to  N(0, 1).  We  want  to  test  0  =  0 
against  0^0.  The  RST  in  this  case  is  defined  by  the  stopping  rule 

T  ss  inf  {n  >  mo,  S*/(2r»)  >  o}  . 

1.1.  Forward  Method. 

The  essential  ingredients  of  this  method  are  the  likelihood  ratio  of  a  mixture  measure 
Q  and  the  probability  measure  Po  under  the  null  hypothesis,  and  the  Wald  likelihood  ratio 
identity.  Let  Q(A)  =  J^ooP,{A)d0  then 

Here  the  notation  %{Y)  means  that  fi  and  v  are  considered  to  be  measures  on  the  <r-field 
generated  by  Y,  and  jfc(Y)  is  the  Radon-Nikodym  derivative  of  the  restricted  measures. 
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By  Wald’s  likelihood  ratio  identity 

Po{T  <  m}  =  EQ{{T/2ir)l^e-sT^T;  T  <  m> 

=  r  E,{(T/2r)l'2ezqp\-{a+  fl,„(r))];  T  <  m}d$ 

J — OO 

=  {a/x)l^e~a  n  E9  |(T/2a)1^2e-Km^;  T  <  m}  d$ 

where  Rm(T)  =  (S|,/2T  —  a)  is  the  corresponding  excess  over  the  boundary  for  this  problem. 

Before  we  go  any  further,  let  me  introduce  some  notation.  Throughout  this  work  I  use 
R(T)  to  denote  the  excess  over  the  boundary  corresponding  to  the  stopping  time  T.  Usually 
the  stopping  time  depends  on  a  scale  parameter  m.  To  emphasize  the  dependence  on  m  I 
also  write  Rm(T)  or  Rm  if  it  does  not  cause  confusion,  and  Roo{T)  or  Rx,  the  corresponding 
limit  in  distribution  as  to  — ♦  oo. 

If  a,  m,  m0  tend  to  oo  in  such  a  way  that  (2 a/m)1/2  =  8t  <  80  =  (2a/mo)1^2  then  an 
argument  using  the  strong  law  of  large  numbers  shows  that  with  P^-probability  one 

(T/2a)1^2l(mo<j'<TO)  — *•  0~l  1|, „,„,(*) 

and 

Po{T  <  to}  ~  (o/jr)l/2e-“  /  r%{e-R~<T>}d0 

.  Now  Et(e~Ro°^)  can  be  approximated  using  nonlinear  renewal  theory  developed  by  Lai 
and  Siegmund  (1977,  1979)  or  Woodroofe  (1976a)  and  the  approximation  is  completed. 

This  method  has  been  generalized  to  RSTs  for  curved  exponential  families  by  Lalley 
(1983). 

1.2.  The  Backward  Method. 
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The  backward  method  which  is  due  to  Siegmund  (1985)  sets  its  primary  goal  on  approx¬ 
imating  the  conditional  probability  P^(A)  =  P{A  |  5m  =  (},  which  by  sufficiency  of  Sm  is 
independent  of  8.  Then  the  power  and  significance  level  may  be  obtained  by  uncouditioning 
with  respect  to  the  distribution  of  Sm.  In  Chapter  3,4  we  shall  generalize  this  method  to 
multiparameter  exponential  families. 
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The  essence  of  this  method  involves  randomising  the  starting  point  of  a  process,  then 
treating  it  like  a  process  running  backward  from  the  point  of  conditioning.  Let  P^(A)  = 
P(A  |  So  —  A,  Sm  -  £)  and  T*  =  sup  {n  <  m,  S*/2n  >  o}.  Observe  that  Pq”'(T  <  m)  = 
P$(T*  >  m0).  Let 

P^'M)  =  I"  PiJ(A)(2Tm)-l/2exp{-[(A-  0V2m]}dA 
J  —OO 

Then 

, , _ < » \v*  /si _  e\ 

dPi’"/S”'  'S'"  ~  '■»'  P\!"  2m/ ' 

Since  under  the  reversed  time  scale  T*  is  a  stopping  time,  Wald’s  likelihood  ratio  identity 


f'-V  >  ».)  =  *}“»  { (£)*'%  (^-|);r>».}. 

The  Pj”*^  distribution  of  S„,  n  —  m,  m  -  1,  •  •  •  running  backward  from  Sm  —  £  is  the  same  as 
the  P0  distribution  of  £  -  S„,  n  =  0, 1,  •  •  •  running  forward. 

Hence  the  expectation  above  equals 

E‘  { (snb) ' [& '  ^7y] ;  r  s  - - "• } 

where  r  =  inf{n  >1,  ({  +  Sn)2/[2(m  -  n)]  >  o}.  Assume  that  =  (2 o/mj)1/2  and 
£o  =  m-1£.  A  law  of  large  numbers  argument  shows  that  r/m  — *  1  — (fo/^i)2  with  probability 
one  asm-too.  The  quantity  above  is  approximated  by 

(m^i/f)exp(-a  +  £2/2m)Po{e-I*“(r)} 


v/here 


JUr)  =  {(Sr-fl7(2(m-r)]}-a 


is  the  excess  over  the  boundary  at  the  stopping  time  r.  Again  nonlinear  renewal  theory  can 
be  used  to  obtain  the  asymptotic  distribution  of  Rm{r)  and  the  approximation  for  P^(T  < 
m)  is  completed.  Unconditioning  f  using  the  marginal  distribution  of  S„  under  Po  yields 
p0(r  <  m). 
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We  may  uncondition  £  using  P$  with  8^0  and  obtain  an  approximation  to  the  powers. 
Unfortunately  the  result  is  not  a  bona  fide  asymptotic  expression,  although  the  numerical 
results  show  that  it  is  a  very  good  approximation.  See  Siegmund  (1985)  Section  9.3  for 
details. 

1.3.  Woodroofe’s  Method. 

This  method,  which  was  developed  by  Woodroofe,  is  quite  different  from  the  two  meth¬ 
ods  described  above.  It  does  not  use  Wald’s  likelihood  ratio  identity.  The  method  first 
approximates  Pq{T  =  n}  then  estimates  Po{T  <  m}  by  summation.  Observe  that  Pq{T  = 
m}  ~  2P{r+  =  n}  where  T+  =  inf{n  >  mo,  S„  >  y/2 na} 

P0{T+  =  n}  =  f*  P<n,(T+  >  n  -  l)(2xn)-1/2  •  exp(-£2/2 n)df . 

Jy/lm 

It  is  easy  to  see  that  the  only  values  of  {  which  are  of  first  order  importance  are  \/2an  +  0(l). 
In  this  range  we  can  approximate  the  curve  y/2na  by  its  tangent  and  the  conditional  random 
walk  by  an  unconditional  one  (with  drift  (o).  That  is,  let  £  =  \j2an  +  y  where  y  is  arbitrary 
but  fixed. 

Pi"){T+  >  n  —  1}  —  Po{-?i  <  V2ak  for  all  mo  <  k  <  n  —  1  |  Sn  =  y/2an  +  y} 

=  Po{5n  -  St  >  y  +  \Z2a[nx^3  -  Jfc1/2)  for  all  mo  <  k  <  n  |  Sn  =  \flan  +  y } 

=  P0{S,-  >  y  +  \/2a(n1/3  —  (n  -  i)1/2)  for  all  1  <  *  <  n  -  m0  |  —  =  p*  +  o(l)}. 

n 

Observe  that  y/2n[nlf7  —  (n  —  t)1/2]  =  on-1/2t  +  0( '/2an_1/2n-s/2t2 )  — ♦  \p*i  if  n  and 

a  tend  to  infinity  in  such  a  way  that  (2o/n)1/f2  — »  .  The  conditional  probability  above  is 
asymptotically  equivalent  to 

Pp.{5i  >  y  +  ^p*i  for  all »  >  1}  =  P^/z{Si  >  y  for  all  i  >  1} 

To  continue  we  need 

Lemma  1.  (Woodroofe,  1982,  p.  .)  Assume  p  =  EXx  >  0.  Let  M  =  min(5i,5j,  -  •). 
Then  for  x  >  0 


(£?(Jf+)]-1P{Sf+  >  x)  =  p~lP{M  >  a)  where  r+  =  inf{n  >  1,  S„  >  0} 
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Proof:  Define  a  =  sup{n  :  S„  =  M}  and  r_  =  inf{n  >  1,  S„  <  0}.  Consider  the  probability 
P{a  =  n,  M  >  x).  By  restarting  the  random  walk  at  time  n,  we  find  that 

P{a  =  n,  Af  >  x)  =  P{r_  =  oo}  ■  P{S„  <  5,-,  *  =  1,  •  • ,  n  -  1  Sn  >  x}. 

A  time  reversal  argument  shows  that 

P{Sn  <  Si  i  =  l,--,n  -  1,  S„  >  x)  =  P{5,-  <  0,  «  «  1, •••,»-  1  S„  >  *} 

-  P{r+  =  n,  5r+  >  x). 

Combizing  all  the  results  above,  we  get 

OO  PC 

P{M  >  x)  -  ^P{a  =  n,M  >  x)  -  ]TP{r_  =  oo}P{5„  <  Si  i  =  1,  •  •  •  ,n  -  1,  Sn  >  x) 

ii=i  n=l 

oo 

=  P{r_  =  «>}  £  P{r+  =  »,  Sf+  >  x)  =  (Pr+)-»P{Sr+  >  z}  =  ,i(£Sr+)-lP{Sf+  >  a:} 

n=l 

We  have  used  the  duality  relation  P{r_  =  oo}  =  (Er+)~*  and  Wald’s  identity  ESr+  =  pEr+ 
in  the  fourth  and  fifth  equality  above.  The  proof  is  completed. 

By  lemma  1  and  the  argument  given  above 
P0{T  =  n}  ~  2P0{T+  *  n} 

~  2  /  /i*[P<1*/2(5r+)]“1P#)./2{5f+  >  x}(2jcn)-1/2  •  erp{-[(2an)1/2  +  y2]/2 n}djj 
Jo 

~  »-‘(«/^)l/2«-8  riE^/iiSr^P,.,^  >  x}c-*’»dV. 

Jo 

The  integral  above  als  limo_oo  Et ,•  exp[-iZ<,(r}]  by  nonlinear  renewal  theory,  where  fto(T) 
=  S*/2T  -  a.  Summing  over  n  and  approximating  the  sum  by  an  integral  yields  the  desired 
result.  Now  we  are  in  a  position  to  make  brief  comments  on  the  three  methods  described 
above. 

If  one  were  only  concerned  with  the  significance  levels  of  the  RST  then  the  forward 
method  is  the  most  general  of  the  three.  If  one  wants  a  second  order  approximation  to  the 
significance  level  then  Woodroofe’s  method  seems  to  be  the  appropriate  method  to  use.  Since 
the  Monte  Carlo  results  in  Chapter  5  show  that  the  “obvious’’  second  order  correction  works 
quite  well,  the  complicated  second  order  correction  developed  by  Takahashi  and  Woodroofe 
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(1981,  1982)  seems  to  be  unnecessary  (at  least  in  those  cases).  But  if  we  restrict  our  attention 
to  the  linear  hypothesis,  then  the  backward  method  produces  the  most  fruitful  results,  that 
is,  it  can  be  used  to  approximate  the  significance  level,  power,  and  p-values  of  the  RST  and 
MUST.  One  of  the  major  contributions  of  this  thesis  is  to  generalize  the  backward  method  to 
multiparameter  exponential  families. 

The  rest  of  the  thesis  is  organized  as  follows.  In  Chapter  2  the  simple  null  hypothesis 
case  is  considered  and  a  theorem  which  relates  the  excess  over  the  boundary  by  the  forward 
and  backward  process  is  proved.  Chapter  3  deals  with  the  composite  hypothesis  problem. 
The  results  there  indicate  the  necessity  for  studying  the  conditional  renewal  theory  which 
is  the  topic  of  Chapter  4.  An  application  to  the  “change  point”  problem  is  also  given. 
Chapter  5  contains  a  careful  treatment  of  an  important  example:  the  repeated  t-test.  The 
numerical  approximations  of  powers  and  significance  levels  of  RST  and  MRST  and  the  results 
of  corresponding  Monte  Carlo  experiments  are  also  reported.  Finally,  the  Appendix  gives  some 
details  of  the  numerical  computation  performed  in  Chapter  5. 
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Chapter  2 


The  Simple  Null  Hypothesis  Case 


When  0o  contains  only  one  point  the  backward  method  generalizes  to  the  multiparam¬ 
eter  exponential  family  easily.  Without  loss  of  generality  we  assume  0O  =  {0}.  In  this  case 
the  stopping  rule  is  T  =  inf  {n  >  mo,  n<f>  (^)  >  tt}.  Define  T‘  =  sup  {n  <  m,  r.<j>  (^j1)  >  a} 
then  P{T  <  m)  =  P{T *  >  m0}.  Now 


dPl(  le  q  ,_fn(Sn~X)fm(t) 

d  P':'  “  A) 

U»S 

where  /„(•)  is  the  density  of  5„  under  Fq,  we  assume  f„  exist  and  satisfy  the  condition  given 
in  Proposition  1  below. 

Pi? (A)  =  P0(A  |  S0  =  A,  Sm  =  t). 

Also  define  Q(A)  =  J  pj?(.4)/m(£  —  X)dX.  The  likelihood  ratio  of  Q  with  respect  to  P^”*)  is 
easily  calculated  as 


L 


n 


jp(m) 

|i)  (^n,  •  ■  •  ,  —  X)dX 

U'l  ^ 


IM 

fn[Sn) 


The  Q  distribution  of  S„,  n  =  m,  m  —  1,  •  •  •  running  backward  from  Sm  =  mfo  is  the  same  as 
the  P0  distribution  of  m(o  -  S„,  n  =  0, 1,  ■  •  •  running  forward.  Under  the  reverse  time  scale 
T  is  a  stopping  time  so  the  Wald  likelihood  ratio  identity  gives 


J?n)(r  <  m)  =  Pim)(r  >  m0)  =  Eq  {L-l,  T*  >  m0}  =  EQ  T'  >  ™o}  • 

Let  r  =  inf{n  >  1,  (m  —  n)<j>  f  )  >  a)-  Note  that  under  Q  m—r  has  the  same  distribution 
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as  T  under  Pq  so  the  expectation  above  equals 


It  is  not  hard  to  see  that  if 


.1  r—M-s,)  i 

M- wo- ,r-  mT 


exist  and  0  <  to  <  1  -  ^  then  rjm  — »  tj0  with  probability  one,  where  a  =  mao,  £  =  m£ o, 
and  the  expectation  above  is  approximated  by 


To  continue  the  computation  we  need 

Proposition  1.  If  for  some  integer  no,  for  some  n>  no  Sn  has  a  bounded  continuous  density 
f„  with  respect  to  Lebesque  measure  on  RJ,  then  as  n  — ►  oo 

f„(nx)  ~  (2xn)-<,/2|$(s;)|-1/2  •  exp[-n^(ss)], 

where  3p(x)  =  V2^ >(d(x)),  the  covariance  matrix  of  Xy  under  hi¬ 
proof:  See  Borovkov  and  Rogozin  (1965). 

By  Proposition  1 

fm-r((-S,)  ~  [2x(m  -  r)]-rf/2  |?  exp  -(m  -  r)^^-~^J 

/*.(£)  ~  (2xm)-J/2|?(f0)rl/2e-m^o) 

30 

•  exp  fm^({o)  ~  (m  -  r)<f>  | 

=  (1  -  CCo)-rf/2|?(Co)|1/2  ■  |?  (]~^-)|  1/2 exp[-m(a0  -  *(&))]  •  E0{e~R^} 

where  Rm{r)  =  (m-  r)<f>  )  -  a  is  the  excess  over  the  boundary  at  stopping  time  r.  Let 

K^(t,  x)  -  (1  -  t)4>  ■  Theorem  2  of  Chapter  3  of  Hogan  (1984)  asserts  that  as  m  — +  oo 
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Observe  that 


=  +  (^2|  -  ■  AC  -  d2)) 


«•?> 


where  J(9^  — *  £q2' )  is  the  Jacobian  matrix  of  the  mapping  9^ 

(m  -  n)<£(2)[(£(2)  -  S^2))/m  -  n]  =  (m  -  n)4><2>(^2))  +  {n$]  -  +  o(l)  a.s. 

=  m^Ulo])  -  "leW  -  V-(2,(*f)]  +  0(1) 

Substituting  the  equation  above  into  (4.1)  completes  the  proof. 


«j(2) 


*'2).  So 


a.s.. 


Before  we  investigate  the  nonlinear  boundary  crossing  problem  let  us  consider  the  linear 
problem  first.  Suppose  we  want  to  find  the  excess  over  the  hyperplane  U  =  {x  :  7  •  x  =  e}  by 
the  d-dimensional  random  walk  Sn  in  the  norm  direction  7,  where  7  is  a  d-dimensional  vector 
satisfying  E( 7  •  Si)  =  7  ■  p.  >  0. 

The  problem  above  can  be  converted  into  a  one-dimensional  problem.  In  fact  the  first 
time  S„  crosses  H  is  the  same  as  the  first  time  the  one-dimensional  random  walk  7  •  Sn  crosses 
the  constant  level  e.  Moreover  the  excess  in  the  normal  direction  by  S„  is  exactly  (7  ■  5j-c  -  c) 
where  Te  —  inf{n,  7  ■  S„  >  c}. 


The  next  theorem  relates  the  excess  over  the  linear  boundary  of  a  conditional  random 
walk  to  that  of  an  independent  random  walk. 


Theorem  2. 

•  STc  -  e  ^  x)  =  P|0>,<»){7  ' STc-c<  x }. 

Moreover  the  equality  above  still  holds  when  c  is  allowed  to  depend  on  m  in  such  a  way  that 
0(nx1/2-'*)  =  e  -*  00  where  a  is  any  positive  number  less  than  1/2. 

Proof:  Clearly  P${Te  <  m*1"0*/2}  -»  1  as  m  — *  00.  So  for  any  given  £  >  0  we  can  find  mi 
such  that  Vm  >  mi  the  inequality  below  holds: 


L 


L. 


J 

l 

A 

,.1 


£ 

l 


■» 


* 


\P${l  •  5Tc  -  c  <  x;  Tt  <  m*1-*)'2}  -  P<${7  ■  %  -  e  <  z}|  <  £. 


Chapter  4 


Nonlinear  Renewal  Theory  for  Conditional  Random 
Walks 


Here  we  study  the  asymptotic  distribution  of  excess  over  the  boundary  by  a  conditional 
random  walk.  The  notation  used  here  is  consistent  with  that  of  Chapters  1,  2,  and  3,  except 
where  stated  otherwise.  We  begin  with  the  following  lemma  which  shows  that  as  the  time 
of  conditioning  becomes  remote  a  conditional  random  walk  behaves  like  an  independent  one 
locally.  Let  =  £<2>  •  m-1,  ^(2)(x)  =  supj(i)_0[5'x  -  ^(0)],  r/>'^(6)  =  V>[(O,0(2)].  For  the 
sake  of  simplicity  write  0^  for 


Lemma  1.  If  n  =  c/im1/2)  then 


jp(m) 

lim 

m-°o  dP(Q  9m) 


(Si  > "  ■  >  Sn) 


1  a-8'  P(o, *<’>)• 


Proof: 

dp 

7 jr^lSir-'Sn)  -  If  (SO  -f(S2  -  S0-f(Sn  -  sn_0f}*ln(t{t]  -  sP)/fW(tM)) 

dP(oKt]) 

•  [/(0^))(5l)  •  /(0, ,,>,(*  -  Si)-  ••/(„,(*>)  (^n  -  5„_i)]_1 
=  fdnU[2)  -  5<2>)  •  l/i2>U<2>)  •  exp{5<2>  •  0<2>  -  n^(i?))r. 


By  Proposition  1  of  Chapter  2  the  quantity  above  tends  to 

[m/(m  -  n)]rf»/2|?(J)(f<2))|1/2|?(2)[(e(2)  -  S<2))/(m  _  n))\~'/2 

■  cxp{-(m  -  n)^,2)[(£(2)  -  Sj?])/(m  -  n)]  (4.1) 

•  expN'^ff1)]  •  exp{-S<2>  ■  0<2)  +  ntfr<2>(0<2))}. 

Now 


0,J,[(f(i)  -  5<2))/(m  -  n)]  =  *<2>(£<2))  +  [K<2)  -  S<2>)/(m  -  »)]  -  V^<2>tf<2))  +  o(m-‘) 


a.s. 
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Observe  that 


=  42)  +  (d7)  -  v#(,^{2)(tf  >))  •  j(&{o]  -  d2]) 


=  *‘2) 


where  J(6 ^  -*  ff1)  is  the  Jacobian  matrix  of  the  mapping  6\ 


f(2) 


(2) 


So 


(m  -  n)^(2)[(£(2)  -  Sl2))/m  -  n]  =  (m  -  n)<^(2>(^2))  +  (n^2)  -  +  o(l)  a.s. 

=  m*W(({o])  ~  »[fM2)  ~  ^(W)]  +  "(I)  *■«.. 

Substituting  the  equation  above  into  (4.1)  completes  the  proof. 

Before  we  investigate  the  nonlinear  boundary  crossing  problem  let  us  consider  the  linear 
problem  first.  Suppose  we  want  to  find  the  excess  over  the  hyperplane  U  =  {x  :  7  •  x  =  e}  by 
the  d-dimensional  random  walk  S„  in  the  norm  direction  7,  where  7  is  a  d-dimensional  vector 
satisfying  JS(7  •  Sj)  =  7  •  jj  >  0. 

The  problem  above  can  be  converted  into  a  one-dimensional  problem.  In  fact  the  first 
time  S„  crosses  U  is  the  same  as  the  first  time  the  one-dimensional  random  walk  7  S„  crosses 
the  constant  level  e.  Moreover  the  excess  in  the  normal  direction  by  S„  is  exactly  (7  •  Srt  -  c) 
where  Te  —  inf{n,  7  •  Sn  >  c}. 

The  next  theorem  relates  the  excess  over  the  linear  boundary  of  a  conditional  random 
walk  to  that  of  an  independent  random  walk. 


Theorem  2. 


>("») 


li®,  -ST'-c<x}  =  P{0jv)  }{7  •  STt  -  c  <  x}. 


Moreover  the  equality  above  still  holds  when  e  is  allowed  to  depend  on  m  in  such  a  way  that 
0 (m1/2-**)  =  e  — *  00  where  a  is  any  positive  number  less  than  1/2. 

Proof:  Clearly  Pj™j{Te  <  m^1-0^2}  — >  1  as  m  — *  00.  So  for  any  given  e  >  0  we  can  find  mj 
such  that  Vito  >  m  1  the  inequality  below  holds: 

•  St,  -  c  <  x;  Tt  <  m'l-“)/2}  -  Pjjb  •  STe  -  e  <  «}|  <  e. 


Chapter  4 


Nonlinear  Renewal  Theory  for  Conditional  Random 
Walks 


Here  we  study  the  asymptotic  distribution  of  excess  over  the  boundary  by  a  conditional 
random  walk.  The  notation  used  here  is  consistent  with  that  of  Chapters  1,  2,  and  3,  except 
where  stated  otherwise.  We  begin  with  the  following  lemma  which  shows  that  as  the  time 
of  conditioning  becomes  remote  a  conditional  random  walk  behaves  like  an  independent  one 
locally.  Let  ■  m~l,  <f>W(x)  =  supj<i)_o[0'z  -  ^(0)],  rf>^(0)  =  ^[(O,0^2)].  For  the 

sake  of  simplicity  write  8^  for 


Lemma  1.  If  n  =  ofm1/2)  then 


lim 

m— oo 


dp(m ) 


dP 


(S±,  •  •  • ,  Sn)  —  1  a.s.  P. 


(o,»i *’)• 


Proof: 

=  [/(so  -m  -  sl)-./(sn  -  5,-0/ -  sP)/j2W*>)  ] 

■  [/(0.9(j))(5i)  •  V<*’)(52  "  Sl)  ‘"/(o.O(5"  - 

=  /i2i„(^(2)  -  S<?>)  •  (/<.2>tf<2))  •  exp {^2>  •  -  «^W(#«)}ri. 


By  Proposition  1  of  Chapter  2  the  quantity  above  tends  to 

[m/(m  -  r»))rf»/2 W *2> ( ^2) ) 1 1/2 1? (2) [( ^ (2)  ~  S<2>)/(m  -  n)]|-‘/2 

•  exp{-(m  -  n)^(2*[U(2)  “  S,*2,)/(m  -  n))  (4.1) 

•  exp[m^(2,(^2))]  •  exp {-SW  ■  8^  +  ni>W{$(2))}. 

Now 


*{i)l(t{2)  ~  S<2))/(m  -  n)\  =  ^«2>(fi2))  +  [(nff  -  S<2>)/(m  -  n)J  •  V*<2>(ef)  +  o(m“») 


a.s. 
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exist,  we  have  r/m  — *  t(0  with  P^j  probability  1,  and  limm_oo  P^j  { r  <  m  -  mo}  =  1. 
Combining  all  the  results  above,  we  find  that  (3.4)  is  approximated  by 

(l-teo)(',/2|?tfo)|l/2-  (3-5) 

where  Rm  =  (m  —  r) A  -  o  is  the  excess  over  the  boundary  at  the  stopping  time  r. 

To  finish  the  approximation  we  need  to  identify  the  excess  over  the  boundary 

by  a  conditional  random  walk.  It  seems  to  me  that  this  topic  has  not  been  treated  in  the 
literature  before.  In  the  next  chapter  we  shall  study  this  topic  and  give  some  applications. 
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where  is  the  expectation  corresponding  to  the  conditional  probability  pj*J. 


■'((») 

By  Proposition  1  of  Chapter  2  we  have 


£g* - h(^)fK^)l 

“p  {~(m  ~ T)  [*  ( ~  *•  ({<1 -t  ’)] } 


-1/2 


=  (2xm)<,‘/2|?tf0)|X/2  •  |?(5)(ff))r1/2emA(«°) 

where  is  the  covariance  matrix  of  sj2^  under  PM. 

Substituting  (3.2),  (3.3)  into  (3.1)  we  have 

1*"  f’  ■  *"«? K^)l 

exp  |a  -  (m  -  r) A  -^r  )]  ’  r  -  m  ~  m°|  '  exp(-m(oo  -  A(£o))j- 
It  is  not  hard  to  show  that  with  high  probability  s\2^  ~  r  •  under  Pj™},  so 

$(2)  (^T~)  -  *(2)  (^T7^)  =  ^(2,(tf)). 


(3.2) 


(3.3) 


-1/2 


(3.4) 


Similarly, 


assumes  rfm  -*  tf0  under  Pj™J. 

For  those  (o  such  that 

t(g  =  inf  {  t :  0  <  t  <  1  —  mo/m,  (1  —  t)A 


=  inf  jt :  0 


(&*M 
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Define 


/W2>) 

where  A0  =  (A^.o)  where  A^1*  6  JZrfl,  o  is  the  sero  vector  in  R !**,  di  +  =  d,  /m*  is  the 

density  for  si2*  under  Pq  where 

S„  =  (S^.S^),  S^eR*',  S™  6  Rdi. 


Now  the  likelihood  ratio 


dQ 

dP(t 


■Q  ,c  s  \  -  ..  9  i/">U-A°)  m  -  /MtfHsP) 

dp^)(sn,  -  fumfn(Sn)- 


Observe  that 


(2) 

is  the  conditional  density  of  Sm  given  Sm  . 

The  Q  distribution  of  S„,  n  —  m,m  —  1,  ••  •  running  backward  from  Sm  =  m£0  is  the 
same  as  the  conditional  Pq  distribution  of  m£o  —  Sn,  n  =  0, 1,  •  •  •  running  forward  and  tied 
down  at  =  mtfK  Under  the  reverse  time  scale  T‘  is  a  stopping  time  so  the  Wald 
likelihood  ratio  identity  gives 


f  <fP^TO*  1 

P^m)(r  <m)  =  P^m)(r*  >  mo)  =  Eq  l  St ;■  •  • ,  Sm);  T*  >  m0  > 

q  i  /J5(s£>)  MO  '  -  "J 


Let 


r  =  inf  |n  :  (m  -  n)A  ^  ^  >  a,  n  <  m  -  mo  | . 

It  is  easy  to  see  that  the  distribution  of  T*  under  Q  is  the  same  as  the  distribution  of  m  -  r 
under  P^“|(.A)  =  Po(A  |  Sm*  =  (W).  So  the  expectation  above  can  be  replaced  by 

£i"lf  ,/’",(£~5'1  '£  ■»-*»}  (3.1) 


Chapter  3 


The  Composite  Null  Hypothesis  Case 


In  this  chapter  we  consider  the  more  difficult  problem  of  composite  hypotheses.  The  null 
hypotheses  considered  in  this  chapter  are  of  the  form 

H0  :  9  e  Q0  =  {9  :  *(1)  »  0}, 

where  0  =  (/(»)  =  (*,, .  ■ . ,  64i)  €  0 W  =  (#rfl+t,  •  •  •  ,#*),  dx  +  d2  -  d.  So  6™ 

plays  the  role  of  nuisance  parameter  here.  The  stopping  rule  corresponding  to  this  case  is 
given  by 

T  =  inf{n  >  mo;  r*A(S„/n)  >  o}  where  A(z)  =  4>{x)  —  4>o(*)- 

In  the  case  of  the  composite  null  hypothesis  the  most  tricky  part  of  both  the  forward  and 
backward  method  is  to  find  a  measure  Q  such  that  its  likeihood  ratio  with  respect  to  the  prob¬ 
ability  measure  under  the  null  hypothesis  is  a  simple  function  of  the  stopping  rule  (asymp¬ 
totically).  For  the  forward  method,  the  Q  measure  is  taken  to  be  the  mixture  of  Py  over 
a  submanifold  N  of  the  parameter  space  0  e.g.  Q  —  JN  Pfda n{0)/  J  d<7jv(0),  where  d0>r(0) 
is  the  differential  element  of  the  manifold  N,  see  Lalley  (1983)  for  details.  For  the  back¬ 
ward  method,  which  we  will  develop  here,  Q  is  defined  by  randomizing  the  starting  point 
of  the  sufficient  process  S„  according  to  a  conditional  distribution.  To  be  more  precise,  de¬ 
fine  r  =  sup{n,  n[<j>(S„/n)  -  *«($„/»»)]  >  a)  and  P$(A)  =  P{A  |  So  —  A,  Sm  -  0, 
Pjm*(A)  =  P(A  |  Sm  =  £).  It  is  easy  to  see  that 


dPff 

jpH  ^S" 

aro,( 


'  »Sm)  — 


fn(Sn-X)fmU) 


I  .  .  .•v^vv 


'.V, 
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Remark  1.  The  change  of  variable  (2.12)  amounts  to  saying  the  backward  and  forward 
process  hit  the  boundary  at  the  same  time.  This  interpretation  helps  us  find  the  appropriate 
change  of  variable  in  more  complicated  situations.  See  Chapter  5  for  an  example  of  the 
composite  null  hypothesis  case. 

Remark  2.  The  relation  that  the  increment  of  forward  and  backward  processes  have  the 
simple  likelihood  ratio  e 1  is  not  accidental.  See  Chapter  5  for  another  example. 

Remark  S.  Theorem  2  has  the  following  merits  as  far  as  numerical  computation  is  concerned. 
It  relates  the  excess  over  the  boundary  of  the  forward  process  and  the  backward  process  in 
such  an  elegant  way  that  no  matter  what  method  you  use  you  only  need  one  program  to 
compute  the  excess  over  the  boundary. 

Remark  4.  Theorem  2  also  implies  that  if  we  ignore  the  excess  over  the  boundary,  in  general 
the  forward  and  backward  methods  will  not  give  the  same  approximation. 
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Proof:  By  (2.15) 

lim  E{exp[-(Ur.  -  «))}  =  (£Wr+)-l[l  -  E(e~u'+)],  (2.16) 

0—00 

where  r+  is  the  first  tine  the  corresponding  random  walk  is  positive.  When  there  is  possibility 
of  confusion,  we  also  write  r+  to  indicate  which  random  walk  we  refer  to.  By  Wald’s  lemma 

WrJ-Mr-Er?.  (2.17) 

By  the  duality  lemma 

Erl  ==  P{r"  =  oo}-1.  (2.18) 

where  rff  is  the  first  time  the  corresponding  random  walk  is  nonnegative.  Wald’s  likelihood 
ratio  identity  gives 

P(r+V  <  oo)  =  E{c~v'+).  (2.19) 

Substituting  (2.17)-(2.19)  into  (2.16)  we  have 

lim  £?{exp[-(I7r.  -  o)J}  =  P{r£  =  oo}P{r+v  =  oo }/fiY.  (2.20) 

a—oo 

Arguing  exactly  the  same  way  we  obtain 

lim  £{exp[-(Vr.  -  a)]}  =  P{r+y  =  oo}P{rr  =  oo}/iiz.  (2.21) 

•“* *oo 

By  obvious  scale  property 

P{t+u  =»  oo}  =  P{r ?  =  oo}  (2.22) 

P{r+V  =  oo}  =  P{rl  =  oo}.  (2.23) 

Dividing  (2.20)  by  (2.21),  using  (2.22)  and  (2.23),  we  get  the  desired  result.  This  completes 
the  proof  of  Theorem  2. 

It  is  routine  to  check  that 

ttz=  W)  W) 

py  m«)-m  tm*)Y 

This  proves  (2.14).  Several  remarks  are  called  for. 
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Ey  a  nonlinear  renewal  theorem  of  Lai  and  Siegmund  (197?) 

Km  E${exp[—T<f>(STlT)  -  a)}  =  [£?(t/,+)]-1  1°°  e~'P{UT+  >  x}dz, 

m— oo  Jo 

I 

where  Y  =  <f>W(0))  +  (Xi  -  il>’(6))6  —  6Xi  -  Xi  is  distributed  according  to  Pg  and 
Un  —  Z)?=i  Yi  Yi  i-i.d.  and  each  Y,-  has  the  same  distribution  as  Y.  Integrating  by  parts  gives 

(E  UrJ- ■  /°°e"*P{£fr+  >  x}dx  =  (BU,Xl[l  ~  E{c~U,+  )]•  (2-15) 

Jo 


The  increment  Z  of  the  random  walk  V„  in  Theorem  1  has  the  same  distribution  as 

VK{„(t{„,0)  •  (1M  -  (1(1  -  .)*  (&)  1(1  -  0*  (f~) 

-  -  ['  (r^;) + rV  (r^;)  -  **  (i^a)] 

=  -iww) + twine))  -  xjwm  ^  (2.H) 

=  -(**! -*(*)) 

where  we  have  used  the  identities  =  W(9)  -  j>(0)  and  ^'(^'(0))  =  6  in  the  third 

equation  above. 


The  likelihood  ratio  of  Y  with  respect  to  —  Z  is  equal  to 

Mu)  _  txp[o{^)  - 

F-z(v ) 

The  following  theorem  is  all  we  need  to  complete  the  program. 


=  t*. 


\ 

Theorem  2.  Let  Y\,  Yj,-  ■  •  and  Z\,Z-x,-  ••  be  two  sequences  of  independent  identically  dis¬ 
tributed  random  variables.  Also  let  U„  =  ]C"=i  Yi,  Vn  —  S?=i  H  the  likelihood  ratio  of  Y, 
with  respect  to  -Z,-  is  equal  to  er  then 


liPo—oe  g{exp[-17r.  -  o)]}  _  ftz 
lim.^oo  £{cxp[-(Yr.  -  o)]}  py 


where  r.  is  the  first  time  the  corresponding  random  walk  exceeds  a,  px  =  EX i  >  0,  py  — 
EY\  >  0. 


) 

» 

1 

i 
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Expanding  the  likelihood  ratio  ^  about  8  and  using  Laplace’s  method  we  have 

=  J  exp[^(tf)  -  4(0)](2 x)-l^d8 

~  exp [t„(8)  -  Ai(0)]exp[4»(^)(^  -  8„)i/2](2x)~1^d$ 

~  expMS„/n)]/|£,(*n)|1/J. 

Substitute  (2.3),  (2.4),  (2.6)— (2.8)  into  (2.2).  We  have 
Po{mo  <  t  <  m,  if>(Sm/m)  <  e/m} 

~  f  ^  ■  a'^c— 

By  Corollary  1  the  backward  method  gives  us 


(2.8] 


(2.C) 


Po{"»o  <T  <  m,  <l>[Sm/m)  <  e/m}  (2-10) 

~  (m/lx)*/*,-  f  I?  ( I''72  (1  - 


.  4(4o/i-*<0)£»»o} 

To  show  (2.9)  agrees  with  (2.10),  let  us  make  the  change  of  variable 

fo 


=m- 


By  (2.1)  and  (2.5)  v:e  have 


l-Uo 
it  =  i  -  Uo 


(2.11) 


(2.12) 


Now 


±i  =  d  m  =  -nWV) 

dd  *  d8J{8, 0)  W(0)-i/>{$))2' 


t  =  ±,,m  .  rm + mp  =  rm  [■  - 

Substitute  (2.11),  (2.12),  (2.13)  into  (2.10).  We  have 
Po{mo  <T  <  m,  <f>(Sm/m)  <  e/m} 


(2.13) 


~  (  (J^Ly1 

JtWWKt/m,  /(#, 0)<uo}  \J 


1  - 


HV) 


MV)) 


dd. 


To  show  (2.9)  agrees  with  (2.10)  it  is  sufficient  to  prove 


1  - 


$il>'(8) 


jkiMiaw 


=  Hm  Ps{exp[-(T^(5x/T)  -  o))}. 


(2.14) 
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EsMfeJM)  =  (h  -  HiWx)  ~  (HO i)  "  W*)),  also  let 

1(0)  =  E,  =  *”(0)  =  Var,(*,) 

be  the  Fisher  information.  Now  define  Q(A)  =  Ps(A)(2r)~l^dd.  Then 

Po{mo  <  t  <  m,  <l>(Sm/m)  <  e/m }  =  Eo{Po[4(Sm/rn)  <  e/m  |  £r);  m0  <T  <m} 


where  £t  is  the  o-field  generated  by  all  events  A  satisfying  API  [T  =  n]  €  T(X •  •  ■  ,Xn)  (see 
Chung  (  )).  Applying  Wald’s  likelihood  ratio  identity,  the  expectation  above  is  equal  to 


eq  l[~(Su---,ST)PamSm/m)  <  e/m  |  fT},  m0  <  T  <  m| 

=  J  Et^(Sir--,Sr)P0mSm/m)<e/m\£T);  mo<r<mj(f0. 

It  is  easy  to  see  that  P«{lim„— oo  ^(5n/r»)  =  .7(0,0)}  =  1,  so 

P#{lim  a~lT  —  {•7(0,O))~1]}  =*  1. 


(2.2) 


(2.3) 


If  mo,  m  and  a  are  related  by  m_1a  ~  r]i  <  m^la  —  ijq  then  it  follows  that 

Vo 


P*(m0  <  T  <  m} 


|1  if  »?i  <  J(0,0)  < 
(0  if  J(0,0)  $ 


f?o]- 


(2.4) 


In  this  case  we  have 


P,{  lim  m~lT  =  t,}  =  1 

m— *oo 


where 


(2.5) 


i t  =  >h/J(0, 0). 

If  we  impose  further  regularity  conditions  such  that  limn_oo  n"  "I4»(^n)  — *  1(0)  holds  then 

P,(  lim  a-l[-ir(0T )]  =  1(0)/ J(9, 0)}  =  1.  (2.6) 


Taylor’s  expansion  gives  ^(5m/m)  =  4>(TXT/m)+  jjj(S„,  -5r)^'(r^r/m)  +  °p(l)  under 
Po,  so  as  m  —»  oo, 


P»{4>(Sm/m)  <  e/m  \  tT) 


(  1  if  <ttit4>'(0))  <  e/m 
1  0  if  Mte1>'(0))  >  e/m. 


(2.7) 
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Rm  tends  to  the  same  distribution  as  the  excess  over  the  boundary  by  the  random  walk 
VK((t(o,0)  •  S„  over  a  distant  constant  boundary,  where  S„  =  (1,  S„).  Now  the  ordinary 
renewal  theorem  can  be  used  to  identify  this  asymptotic  distribution.  This  completes  the 
proof  of 

Theorem  1.  Let  £  =  fora,  o)  <  a-  If  a,  ra  tend  to  oo  in  such  a  way  that  om-1  — ♦  a0 
and  if  t^0  defined  by  (2.1)  satisfies  0  <  t(0  <  1  —  mo/m  then  the  following  asymptotic  relation 
holds 

P((m)(T  <  m)  ~  (1  -  t<0)-‘"2|?(fo)|1''2  •  |?  (y^)\  1/2«Pl-”i(«o  -  ^(eo))]^-!^) 

where  i/_(fo)  =  [^(V'r+)]~1  /0°°  c~*P{VT+  >  x}dx,  Vn  —  £"=1  are  i-i.d.  and  have  the 
same  distribution  as  VK(0(t(0, 0)-(l,  Xi),  X\  is  distributed  according  to  J*o,  and  r+  =  inf{n  > 

0,  v„  >  0} 

Corollary  1. 

Po{T  <  m,  m<f>(Sm/m)  <  c} 

~  (m/ 2v)rf/2  /  (1  -  tfo)-l/2M£o)d£o  •  e- 

where  cq  =  em~l . 

Corollary  2. 

Ps{T  <  m,  m<f>(Sm/m)  <  c} 

~  (m/2v)<,/2exp{-m(o0  -  ^(0)]}  f  (1  -  t (0)~l/2V-((0)em,'(od(o  ' 

«'{^(fo)<eo.O<lo<l-"»0/m} 

The  proofs  of  Corollary  1  and  2  are  omitted  here.  In  principle  it  follows  from  integrating 
the  result  of  Theorem  1.  Corollary  2  needs  further  simplification.  The  actual  procedure 
may  depend  on  the  probability  model  under  consideration,  but  the  arguments  in  Chapter  5 
may  provide  a  clue.  Next  we’ll  derive  Corollary  1  in  the  one  parameter  exponential  family 
using  the  forward  method,  then  show  that  the  forward  and  backward  methods  give  the  same 
result.  Let  J (61,62)  be  the  Kullback-Leibler  distance  between  ft,  and  /#,  i.e.  J (61,62)  = 
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By  Wald’s  likelihood  ratio  identity 


P${7  STc-c<z;  Te<mll-V*} 


ip <m) 

-jp^ - (5i,  —  lSnii-*)/i);  7  -STt-e<x,  Te<m^~a^i 


Applying  Lemma  1  and  Scheffe’s  theorem  (see  e.g.  page  184,  Billingsley  (1979)),  the  expec¬ 
tation  above  can  be  made  arbitrarily  close  to  P.  4(»k{7  •  Stc  -  c  <  x,  T,  <  which 

(0**0  ' 

in  turn  can  be  approximated  by  P ^  •  Stc  —  e  <  x}  when  m  is  sufficiently  large.  That  is 

|P$  {7  •  STt  -  e  <  x)  -  Stc-c<  x}\ 

<  l^”){7  •  STe  -  c  <  x)  -  P<${7  •  ST,  —  e  <  x,  Te  <  m*1-)/2}! 

+  I^Sb  •  %  -  e  <  x,  re  <  m*1-)/2}  -  P^h  -ST'-e<x,Te< 

+  |P(m.*,){7  STc-e<x,Tt<  m<2-“)/2}  -  ^,,,,{7  •  -  e  <  x)\ 

<  3c 

for  m  sufficiently  large.  This  completes  the  proof. 


I 

1 


I 

1 

1 


Although  the  theorem  above  gives  the  asymptotic  distribution  of  excess  over  a  local 
linear  boundary,  the  problems  of  interest  require  that  c  =  0(to).  For  this  case  a  “restarting 
argument”  is  needed.  The  restarting  argument  can  also  be  used  to  determine  the  asymptotic 
joint  distribution  of  stopping  time  and  excess  over  the  boundary.  Before  we  get  into  this 
let  us  find  the  asymptotic  marginal  distribution  of  the  stopping  time  first.  In  Lemma  4  and 
Theorem  5  below  the  setting  are  the  same  as  Theorem  2  except  that  e  =  com  for  some  Co  >  0. 
We  will  need 


Proposition  3  (Borisov,  1978):  Let  V,  i  =  1,2,-  ••  be  a  sequence  of  i.i.d.  random  variables 
such  that  EYi  =  0  Var  Yj  =  1  and  the  moment  generating  function  E(e,Yl)  exist  in  a 
neighborhood  of  zero.  Also  let  Vn  =  £"_j  Yf,  W„(t)  be  the  random  polygonal  curve  with 
vertices  at  (k/n,  Vn/y/n),  that  is 


Then  the  distribution  of  W„(t)  conditioned  on  W„(l)  =  0  converges  to  the  measure  generated 
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by  the  Brownian  bridge  W°(t)  =  W(t)  —  tW(  1)  where  W(t)  denotes  the  standard  Wiener 
process. 


Lemma  4. 


Hm  P$ 

m— oo  C'  ‘ 


Tc-c/tf 


(2) 


<  X 


=  $(*) 


where  /42*  =  7  •  (0,  £q2')>  ffo  ~  V*Tt[a)('1 4  $i)  is  the  variance  of  7  •  St  under  P#(0).  &{x)  is  the 
distribution  function  of  standard  normal  distribution. 


Proof:  First  observe  that  it  is  sufficient  to  prove 


lim  pjm) 


1 


<  * 


> 


$(2) 


where  p-,  =  7  •  Because  P^™j  may  be  obtained  from  P^m'  by  integrating  out  and  by 
Proposition  1  of  Chapter  2  under  Pq  =  fM/m  is  degenerated  at  zero  this  is  true  because 
in  Chapter  we  have  assumed  that  p( 0)  =  0.  Now 


fhTc  ~  *  _  lhTc  -tStc  TSTe-c 


7  •  St,  -  e  is  the  excess  over  the  boundary  at  the  stopping  time  Tt.  Since 


{7  •  ST,  -  c  >  m‘}  C  { Tt  t  ((1  -  S)t,(  1  +  i)t)}  U{^^ma x  7  •  Xn  >  mf) 

where  t  —  e/p1. 

^((m){7  STt-c>  m<}  <  P^{Te  *  [(1  -  6)t,  (1  +  «)*]}  +  max  j+<)f  7  •  Xn  >  m‘} 

By  Proposition  3  limm_ao  P^{Tt  £  [{1  -  S)t,(  1  +  S)t]}  — »  0  and  by  Lemma  8  below 

Pim*{  max  |7  •  X„\  >  m'}  <  2£tPimU|7  •  Xi\  >  mf}  -*  0  as  m  — *  00. 

So  7  •  Sj>,  -  c  =s  op(m‘)  for  V«  >  0.  This  shows  that 

P-,Te  -  e 

ij  (}-%)■ 


'> * V- >>y*  Iv" •'  S  v-V'*/' 
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has  the  same  asymptotic  distribution  as 


IhTt  -tST' 

K1  -  ft) 


Let  Wn  =  7  •  S„  -  (7  •  £0 )n/m.  Then 


fhTc  -  tSt e 


<x)  =  P 


<  x  I  Wm  =  0 


Notice  that 


{\WTt  -  W,  |>  em1/2}  C  {Tt  [(1  -  «J)t,  (1  +  5)il)  U  {(1_i}|““j1+<)p  \W"  ”  Wt\  >  £ml/2} 


{  max  | W„  -  Wt\  >  cm1/2}  C  {  max  \W„  -  j m x/2}. 

So 

P{|^  -  Wt\  >  em‘/2  I  Wm  =  0>  <  P{r*  i  [(1  -  6)t,  (l,)t]  |  W„  -  0} 

+ 'VuSSh* >  s",/’  "v-=°> 

We  have  shown  P{Tc  $  [(1  -  S)t,  (1  +  5)t]  |  Wm  =  0}  — ♦  0  as  to  — ►  oo.  By  Proposition  3 
p{.  ... . >  |"»1/2  |  =  0}  -*  P{  sup  >  |} 

which  tends  to  zero  as  S  — ►  0. 

This  shows  that 

WTc  _  WTc  -  W,  [  FT, 

v/m  v/ro  ^/to 

has  the  same  asymptotic  distribution  as  but  by  Proposition  3  has  asymptotic  dis¬ 
tribution  N  (o,<T(  (l  -  This  proves 

lim  Piw)  I  <  *|  .  *(*). 


MIU  A  f 
*—•00  ' 


V 


This  completes  the  proof. 
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Theorem  6. 


=  *(*)  Km  P(o4mj{7  '  St,  -  e  <  y) 


Proof:  Define  T‘  =  inf{n,7  ■  Sn  >  c  -  m1/8}.  Then 

{T  -  T1  >  cm2/6}  C  {  max  7  •  Sn  <  e}  C  {  max  7  •  S„  -  iSr>  <  nx1^8}. 

0<»<r+<mV»  "  rsn^r+cm*/* '  ' 

So 

P(^{T  -  T'  >  em2^}  <  P<#{  max  7$»  -  7  ■  Sr  <  m1/3} 

*  5  r#<n<r#+tm1/* 

=  ‘  Sn  -  ml/^  -  Pf<*>*7  *  5I‘mV‘l  -  ml/S^ 

(  dp$  \ 

“  I  5jp  7,7  Sl<mV^); 7  •  <  m1/3  > 

^  (0.*0  )  J 

^  Ji?oopo,^»u<*>)){7‘ V*'*i  *  ml/s> 

by  Lemma  1  and  Shefte’s  theorem. 

It  is  clear  that  ^0<(*)j{7  •  5:  m1/8}  — ►  0  as  m  -*  00  by  the  strong  law  of  large 

numbers. 

The  argument  above  shows  that  IT  -  T'\  —  Opfm1/2)  which  implies 


Now  the  idea  is  to  restart  the  process  from  Sy».  Define  T"  =  inf{r»,7  ■  5n+r'  >  c)  and 
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Ri  -  {(«!,  •  •  ,*<);r(l  -  6)Um  <  V  <  T'(  1  + 


*  =  1,  •  •  • ,  d}  where  (  -  (&,  * •  • ,  fr), 


s 


<  z  > 


Then  the  probability  above  equals 

Jim,  [  P$;*{lST*  -  (c  -  7  •  $r)  <  F>P${r'  €  du,  s£>  €  dv} 

-  Jim,  /  ^“,“UJ{7Sr-  -  (c  -  Tf  •  Sr)  <  y}P${r'  €  du,  sj?  6  dv} 

since  limm_O0  P^m*{5r  €  P*}  =  1. 

Now  by  Theorem  2 


{•y  -  ST>c  -  (e  -  7 ST.)  <  v}  -  Km  “  c  <  V>l  <  £ 


on  the  set  A,  D  Rt  for  6  sufficiently  small.  This  implies  if  we  first  let  m  oo  and  then  6  -*  0. 
Then  we  have 


■  Ss.  V»«r>i(l  -  - e  <  *>  ,<’1  (-*■> 

=  ♦(')  •  ,!i“  •  Sr.  -  C  <  »} 


This  completes  the  proof. 


The  next  theorem  is  the  main  result  of  this  chapter.  It  is  also  the  result  we  need  to 
complete  the  approximation  in  the  previous  section.  Let  Tm  —  inf{n  >  m0,  mH(S„/m)  >  0}. 
From  here  to  the  end  of  the  proof  of  Theorem  6  S„  will  be  defined  as  Sn  —  X,-  where  X,  = 

(1,  X,).  X,- 1  =  1,2,  -  -  is  an  i.i.d.  d-dimensional  random  vector  sequence.  The  distribution  of 
X,-  can  be  imbedded  in  a  d-parameter  exponential  family.  The  reason  for  defining  5n  in  this 
way  is  to  include  a  more  general  stopping  rule  which  is  needed  in  applications.  Before  stating 
the  theorem  we  list  some  assumptions  which  are  easy  to  check  when  applying  the  theorem  to 


■’*'•1'* ■’■'v / 


-.-r-.- V 
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a  particular  example,  but  it  is  tedious  to  impose  conditions  on  the  function  H  which  imply 
these  assumptions. 

(I)  3to  >  0  such  that  limm—*,  P^j  {|^f  ~  *o|  >  e}  =  0  Ve  >  0. 

(II)  H  is  continuous  in  a  cone  C  containing  the  line  of  drift  (l,/i(0, 

(Ill)  H  has  continuous  first  partial  in  a  neighborhood  of  top  and  i-p  0  where  7  =  VH(top). 

Let  N  =  {(2j,---,zto+i)  :  H(xi, •  •  • , zj+i)  <  0}.  If  assumptions  (II)  and  (III)  are 
true,  then  by  the  implicit  function  theorem  there  exists  a  continuously  differentiable  func¬ 
tion  /(«|,  ■••,xj)  satisfying  H(xlt  ,xj,  g(xi,  •  •  - ,  xj))  =  0  V(xi,  •  •  • ,  xj)  belongB  to  some 
neighborhood  0  of  (topi ,  •  •  • ,  toMd)  and  g(topi,  •  •  • ,  topj)  =  toMrf+i • 

Let  7  =  {(zj ,  •  •  • ,  xj+i)  :  xj+i  —  g(xi,  •  •  • ,  xj)}  be  a  d-dimensional  surface  which  is  well 
defined  on  0  and  extend  in  some  smooth  way  to  RJ. 

(TV)  Boundary  (M)  nC  =  Cn  7. 

Now  we  are  ready  to  state 
Theorem  0.  Tf  assumptions  (I)-(IV)  hold  then 

fJj®0 P$ imH(sTm/m)  <x}=  Urn  P(M,,))h  ST.-c<  x). 

Before  we  go  any  further,  let  me  make  some  remarks. 

Remark  1:  The  problem  is  invariant  under  rotation  so  we  may  choose  the  coordinate  system 
such  that  p  =  (0,0,  ••  •  ,0,  ||/t||). 

Ren:ark  2:  The  limit  on  the  right  hand  side  of  the  equation  above  can  be  determined  by  the 
ordinary  renewal  theorem. 

Although  the  proof  of  Theorem  3  is  very  complicated  and  technical,  the  basic  strategy 
is  not  difficult  to  explain. 

It  will  be  shown  that  to  determine  Iimm—oo  <  x)  is  equivalent  to 

finding  the  asymptotic  distribution  of  excess  over  the  hypersurface  m7  by  the  conditional 
random  walk.  In  Lemma  4  below  we  shall  exhibit  a  surface  mT^  which  is  “near*  m7.  The 
idea  is  to  condition  on  the  position  of  the  random  walk  at  the  first  time  it  crosses  m7^  then 
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restart  the  random  walk  from  there.  Lemmas  5  and  6  below  amount  to  showing  that  the 
restarted  random  walk  will  cross  m7  in  a  time  of  magnitude  o(my/2).  Now  we  can  simplify 
the  problem  in  two  directions.  First,  by  Lemma  1  within  the  time  of  magnitude  o(m*/2) 
the  conditional  random  walk  behaves  like  an  independent  one.  Secondly,  since  the  restarted 
random  walk  will  cross  m7  within  time  o(m}!2)  the  only  important  part  of  m7  is  a  set  of 
diameters  o(m1/2),  within  which  mT  behaves  like  a  hyperplane.  (To  every  man  on  the  earth, 
the  earth  is  “flat”).  Finally,  an  argument  like  the  proof  of  Theorem  2  can  be  used  to  finish 
the  proof.  The  proof  of  Theorem  3  is  preceded  by  three  lemmas. 

Lemma  7.  Let  =  m7  -  Jbp/||/i||  where  k  is  a  positive  number  which  may  depend  on 
m.  There  exist  aj  >  >  0  such  that  qj k  >  d(m7  H  C,  m7 M  fl  C)  >  auk. 

Proof:  Let  M  =  supjea  ||V^(r)j|,  <  =  (2Af)~1.  (If  M  =  0  then  g  is  constant  over  0,  and 
Lemma  I  holds  trivially.  In  fact  d(m 7  OC,  m7^  H  C  =  k).  Now 

d(m7  n  C,  m7(4)  PI  C)*  >  ^  -  y)  -  *m/M|2) 

=  # J^nc(m*||*  -  Vll2  -  2 mk{x  -  y,  #*/l!/*H>  +  *2)- 

Note  that 

m7^z  ~  idl2  -  2mi(x  -  y,  m/IIm!!)  +  *2 

=  ****[|l(*i»  *  •  •  »**)  -  (vit  •’  •  ,iw)  ||2  +  fy(*i,  •  ••,*<*)-  g{v  i,  •  •  • ,  im)I2] 

(4.2) 

-  2mk[g{xl<-  •  • ,  a*)  -  y(yx ,  •  •  • ,  y<*)]  +  *2 

=*  "*2||(*i.  •  *  • ,  *d)  -  (Vi,  •  •  * ,  W)ll2  +  [m(y(xi,  •  •  • ,  a*)  -  g(yu  •  •  • ,  yd))  -  k]2 
where  the  first  equality  follows  from  (m/IImII)  •  *  =  y(x i>  •  •  • ,  xj).  Let 

<i  =  inf (||m(z  -  y)  -  x,y€  7  DC,  ||(*i, •••,**)-  (Vi, Vrf)||  >  tk/m} 

ei  =  inf{||m(x  -  y)  -  *m/||mIIII;  x,y  €  7  HC,  ||(*i, •  •  • , xd)  -  (|fi,- -- > V«r)||  <  (k/m). 

By  (4.2)  <i  >  £2i2.  By  the  mean  value  theorem 

!#(*»,■  •*.**) -#(»!,  ■••,»<)!  <  M\\(xi,---,xd)  -  (Vs,  -  •  - ,  Vrf)|| 

so  fj  >  (k  —  eMk)2  >  k2/ 4,  clearly  d(m7  PI  C,  m7^  PI  C)  >  q  A  <j.  Also  k  >  d(m7  0 
C,  m7^  n  C)  by  the  definition  of  m7^.  This  completes  the  proof  of  Lemma  1. 


SO  Chapter  4 •'  Nonlinear  Renewal  Theory  for  Conditional  Random  Walk s 


Let  xi  =  || si ||*  =  £?=  1 1*['^|-  The  following  lemma  is  true. 

Lemma  8.  limm— »  mP^ra*{||Jfi||*  >  m‘}  =  0  Vf  >  0. 

Proof:  By  sufficiency  the  conditional  probability  pj,m^  is  independent 'of  the  parameter  6,  so 
we  may  take  0  =  0,  EqX i  =  (o 

mP<"*>{||X,||.  >  m‘}  =  m  /  f(x)  •  /.-»(*  -  x)/fm(t)dz.  (4.3) 

By  assumption  sup,  |/„(s)|  <  D  for  all  n  >  no,  and  proposition  1 

MO  ~  (2jrm)”I^H:(&)r1^aq>{-i*^(&)}. 


By  (4.3)  4>(£o)  =  0,  so  for  m  sufficiently  large 

MO  >  (l/2)(2xm)-l/*|?(f0)r/*. 

Now 

/  /(*) ' /m- 1({  -  x)/M0dz  <  2B(2xms|$(£o)|)1/2  /  f(x)dx 

<  2B(2xms|$(e0)|)l/2e"WS(eA||X,II*))  VA  >  0. 

Since  the  moment  generating  function  of  si  exists  in  a  neighborhood  of  zero,  iJ(e*!lx,IU)  <  oo 
for  A  sufficiently  small. 

It  is  clear  that  as  m  — *  oo 

2P(2xms|?(^)|)l/2e-Am'  ■  ^(eA»x'H*)  -  0. 

This  completes  the  proof. 

Define 

r«  =  inf{n  >  mo  :  Sn  crosses  m/^}. 


Lemma  9.  For  any  given  6  >  0  there  exists  m*  such  that  tor  all  m  >  the  following 
inequality  holds 

P$(<Unt/3  >  d(ST^.„vmT  n  C)  >  Q»  m1/3}  >1-6 

where  a«  >  aj  >  0  are  independent  of  m  and  6. 
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Proof:  Since  with  probability  close  to  one  the  conditional  random  walk  S„  stays  in  the 
cone  C  for  n  sufficiently  large,  and  we  hare 

P€(S{||^rr./.,IU  +  m1'1  >  d(5rim,/.)(  m7nC)}>l-|  (4.4) 

MS  m/nC)  >  d(mT nC.mfnC)- >1-|.  (4.5) 

'  *m  *m  ■ 

Clearly 

>  m‘)  -  Pf(“,{||JfTr./.)||.  >  m«,  Tt''$)  <  Pm}\  <  ~ 

for  some  constant  fl  >  0  and  m  large  enough  and 
f,$<lixrir./-,ll.  >  t;'"  <  Pm) 

The  last  quantity  above  tends  to  zero  by  Lemma  5.  This  implies 

>  ">*>  <  \ 

for  m  sufficiently  large.  Choose  e  <  In  view  of  (4.4)  and  (4.5),  the  proof  is  completed. 
Now  we  are  ready  to  prove  Theorem  3. 

Proof  of  Theorem  S:  Let  =  inf{n  >  m0,  Sn  cross  ml}.  Since  the  conditional  random 
walk  S„  will  stay  in  the  cone  C  with  high  probability  Vn  >  mo,  by  assumption  (IV)  T£,  =  Tm 
with  high  probability.  By  Lemma  6  the  conditional  random  walk  starts  at  5  will  cross 

ml  in  a  time  that  is  Op(m1/,J).  Now  the  idea  is  to  compare  mT  with  its  tangent  plant  at 
mtop  in  a  neighborhood  with  diameter  that  is  0(m1/'s)  via  a  Taylor  expansion 

mf  (-')  =  m/( 0)  +  V/(0)'  ■  h  +  —h!  •  V2/  (s—')  ■  h 
\m/  m  \  m J 

where  j|/i||  =  0(m‘^s),  0  <  6  <  1,  so  the  conditional  random  walk  starting  at  5  (m»/*j  crosses 

Ji m 

mT  as  if  it  crossing  a  hyperplane  HP  which  is  0 (m1/*)  away, 

HP  =  {z;  x  •  7  =  cm}  e  -  m  =  0(mt//3). 
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Sn  crosses  HP  when  Sn  •  7  crosses  the  constant  level  Cm.  The  excess  of  interest  turns 
out  to  be  the  excess  in  the  normal  direction,  in  the  case  of  a  hyperplane  it  equals  7  Srem  ~  cm- 

Let 

A\  —  {Excess  over  mT  in  7  =  VH(tofi)  direction  <  x}. 


Then  (by  the  arguments  above) 

ji»  J$M.>  -  Jta  I  V">1} 


-  Cm  <  *]} 


where  £  *3  the  rr-field  generated  by  events  prior  to  T„ 

Tin 


=  fw  -  *r, 

1  m 

Also  let  f,j2)  =  £(2,/(”»  “  rj,m  /  ’)  A2  =  {|f,j2)  -  f'2)|  <  6,  TCm  =  0J,(m1/3)}.  It  is  clear  that 
Aj  has  probability  contain  close  to  one  as  m  00. 

Now 


^  ’[^ -«m<  «]}  -  •  5rc  -  c  <  z\\ 

<  l^!{^rn'TLm/\  •  STcm  ~  Cm  <  *]}  -  ^S{P^-rir,/,,)(7  ■  STlm  «);  At}\ 

+  -  STtm  -cm<  x);  A2) 


~  Sr‘  -  c  ^  z>; 

+  l*J${  Km  p(0 •  sr.  -  c  ^  *); 

“  Po^*>((‘*>))(7  STt-e<  *);  Aj}| 

+  {,«“■ P(o^».(««,(7  •  -  c  <  *);  A2}  -  Km  •  %  -  c  <  z)\ 


=  /1  +  /2  +  /j  +  Is  <  4e 


where  J2  can  be  made  arbitrarily  small  by  Theorem  2  and  /j  <  f  by  Scheffe’s  theorem  since 
the  density  of  tends  to  of  P(o,#(*)(e(*>)) 

Let  us  define  the  excess  Rt„  to  be  the  quantity  such  that  Sym  -  j^j Rt„  G  mT.  A  Taylor 
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expansion  shows  that 

mn(STJm)  =  mH  +  VH(i)  •  ^Rrm. 

The  first  term  on  the  right  hand  side  vanishes  by  the  definition  of  7. 

The  second  term  tends  to  the  excess  over  m7  in  the  normal  direction  of  m7  at  fito  since 
x  -*  ptQ.  We  have 

pi“i  {”**  (%0 s  *}  - ^  -  < « « ■>• 

The  proof  is  completed. 


Combining  the  idea  behind  the  proof  of  Theorem  5  and  Theorem  6  it  is  not  hard  to  see 
that  a  theorem  like  the  following  should  be  true.  The  proof  is  omitted,  since  every  relevant 
step  has  appeared  in  the  proofs  of  Theorems  5  and  6. 


Theorem  10.  Under  the  same  assumption  in  Theorem  6,  the  following  equality  is  true 

Tm  -  tom 


lim  P(”J 

m— oo  <{  1 


l  \/(1  - 

=  *(x)  tlun  P(d,«,)){7.5r,-c<v} 


<  x,  mH(SrJm)  <  y  ) 


Now  we  are  ready  for  some  applications.  The  first  application  is  the  approximation  of 
the  conditional  probability  P^(T  <  m)  in  Chapter  3.  By  (3.5) 

plm)(T  <  m)  ~  (1  -  •  WltfVU  -  tu),  1 ']r1/2 

£?JS(e-R'")exp{-m[ao-A(e0)]} 

where  fZ,*  =  (m-r)A[(£— Sr)/(m—  r)] — a  is  the  excess  over  the  boundary  at  the  stopping  time 
r  =  inf{n  <  m;  (m-n)A[(£-S„)/(m-n)]  >  o}.  In  this  case  £T(v)  =  (l-t0)A[(^o-v/l-t0) 
where  v  —  (t0,v)  €  Ri+l,  to  €  R,  v  6  R?.  By  Theorem  3 

£$(*-*’")  /o°°  '~’P{Vr+  >  x)dz  •  [E(Vr.))-1 

where  Y  =  V£T[t{o(l,  m(0>  *(Jl(£o!))))]  •  (1.  -^l).  X\  is  distributed  according  to  ^(o, #(»)(((*>)) ’ 
and  Un  =  ZU  Yi- 
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The  second  application  is  concerned  with  the  so-called  “change  point”  problems.  Assume 
that  Xi,X2,  ■  ■  ■ ,  X„,  are  independent  normally  distributed  random  variables  and  that  X,- 
has  the  mean  value  pi  and  variance  1.  Suppose  we  are  interested  in  testing  the  hypothesis 
H0  :  Pi  =••■  =  Hm  against  the  alternative  Hi  :  there  exist  k,  l  <  k  <  m  -  1  such 
that  pi  =s  •••  =  fik  pk+1  =  pk+2  =  •••  =  pms  if  Jb  where  known,  the  problem  would 
be  a  two  sample  test  of  the  equality  of  the  means  of  the  first  sample  Xi,Xt,-  •  ■ ,  Xk  and 
second  sample  X*+i, Xj+2,  * •  •  ,Xm.  For  this  problem  the  log  likelihood  ratio  statitics  would 
be  A*,m  =  k(m  -  k)(Xk  -  X*,m)2 /m  where  Xk  =  k'1  2}  X,  and  Xi,m  -  (m  -  Jb)'1  Y^T+i  xi- 
Since  Jb  is  in  fact  unknown,  the  log  likelihood  ratio  statistic  is 


max  A„  m  (4.6) 

and  the  significance  level  of  the  likelihood  ratio  test  is  the  probabiity  under  Hq  that  the 
random  variable  (4.G)  exceed  some  constant  e. 

Let  S„  —  X, .  Simple  algebra  shows  that 

hn<m  =  (5„  -  nSm/m)2/n(  1  -  n/m). 

It  is  easy  to  see  that  under  Ho  the  random  varaibles 


S„-nSm/m  n  —  1,2, 1 

have  the  same  joint  distribution  as  Si,  S2,  •  •  • ,  Sm_i  given  that  Sm  =  0  so  the  significance 
level  is  given  by 

P0(m){T  <  m  -  1}  (4.7) 


where 

T  »  inf{n  :  |S„|  >  fr[n(l  -  n/m)]1/2} 

6  =  c‘/2. 


(4.8) 


We  will  prove  an  approximation  formula  which  contains  (4.7)  as  a  special  case. 

Theorem  11.  Let  T  be  defined  by  (4.8),  and  assume  that  b  — *  00,  mo  — ' *  00,  mt  — *  00, 
and  m  -*  00  in  such  a  way  that  for  some  0  <  to  <  <i  and  pi  >  0  m,/m  — *  t,  (t  =  0, 1), 
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6/m1/2  =  Hi,  then  as  m  — ►  oo 


rpi\t0  -lj  ' 

Pj"Vo  <  T  <  m}  ~  2(S/2x)1'3e-‘'3  /  rMf  +  M?*-1 W 


where  v((  +  will  be  identified  in  the  proof. 

Proof:  Let  Q pjm^d(  ■  (2x)~1'3.  An  easy  calculation  shows  that  the  likelihood 
ratio  of  Xi,  ■  ■  •  ,Xn  under  relative  to  is 


[m(m  -  n)/n]1/2exp  [S2/2n(l  -  n/m)] 


from  which  it  follows  by  familiar  argument  that 

P0(m,{rn0  <  T  <  m,}  =  m"1  f°°  P<m){[r/(1  -  T/m)]'/Jex p  [-(l/2)S2/r(l  -  T/m)\  ; 

J  —  OO 

mo  <  T  <  mi}<ff(2T)1/2 

=  /_*  E{3  {[27(1  -  T/m )]1/2  exp  [-(l/2)S?/r(l  -  T/m)} ;  m0  <  T  <  m, }  d£  •  (2X)'1/2 

=  (2T)-^e-^  r  E™{[T/(  1  -  T/m)}1'3 
J  — oo 

exp  [-(1/2)(5|/T(1  -  T/m)  -  c)]  ;  m0  <  T  <  m}d£. 

(4.9) 

Solving  the  equation  £  =  Hi[tJ *(1  -  t^)]1/2  for  t(,  we  have 

1 

t(  ~  i  +  U/m)2' 


We  know 

W>°  (410) 

Solving  the  following  inequality  for  £,  we  have 


<i  <  miv)2  <  **  ** ■ 1)1/2  <  *  <  ^(‘oi  -  i)i/2  (4ii) 

or  ~Hi(tol  ~  1)1/J  <  Z  <  '  Hit;1  ~  1)1/J- 
It  is  easy  to  see  that 

lim  Pj^[m0  <  T  <  m)  =  1  for  £  satisfying(4.11). 

IB“*00  ' 
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Applying  Theorem  3  with  H(x,y)  =  (1/2)  -  Mi)  and  =  (t{£,  tf).  we  have 

*£f  [-(1/2)(5|(1  -  T/m)  -  e)]}  -  )p lc—/>{i7r+  >  x}dx 

Jo 

where 

Y  =  •  (X»  +  f,  1) 

=  (r4)(x'  +  ^l) 

-(<+f)h+ (i/2,0 +f)]' 

Un  =  Z)"=i  y%  and  Xi  is  distributed  according  to  the  standard  normal. 


Define 


r[B[Ut.)]-le-P{Ur+  >x)  =  vtf  +  rfr1)- 

Jo 


(4.12) 


Combining  (4.10),  (4.11),  and  (4.12)  we  obtain  from  (4.9)  that 

. 


F0(m){rn0  <  T  <  m)  ~  2(c/2x)1'ie-‘/>  °  fM*  +  firl)d( 


This  completes  the  proof  of  Theorem  11. 
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Table  5.5 

Significance  Levels  of  Group  Repeated  t-Test 


#  of  observations 

in  a  group 

a 

mo 

m 

analytic 

approximation 

Monte  Carlo 

(2000  replications)* 

2 

3.65 

8 

40 

0.050 

0.052  ±  0.001 

3 

3.6 

10 

55 

0.049 

0.049  ±  0.001 

4 

3.6 

10 

70 

0.051 

0.052  ±  0.001 

5 

3.6 

10 

80 

0.050 

0.052  ±  0.001 

7 

3.6 

15 

120 

0.047 

0.047  ±  0.001 

‘Importance  sampling  is  used  in  the  Monte  Carlo  experiments  above 
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Table  5.3 


Significance  Level  of  Modified  Repeated  t-Test 


a 

e 

m0 

m 

Analytic 

Approximation 

Monte  Carlo 

(6000  replications) 

3.8 

3.6 

7 

30 

0.050 

0.053  ±  0.001 

3.95 

3.6 

7 

40 

0.050 

0.052  ±  0.001 

4.0 

3.6 

8 

50 

0.048 

0.049  ±  0.0009 

4.7 

4.2 

10 

80 

0.028 

0.027  ±  0.0007 

5.0 

4.5 

10 

100 

0.023 

0.023  ±  0.0066 

Table  5.4 

Powers  of  Level  of  Modified  Repei^ed  t-Test 


a 

c 

m0 

m 

n 

Analytic 

Approximation 

Monte  Carlo 

(2000  replications) 

3.8 

3.6 

7 

30 

0.8 

0.952 

0.956  ±  0.005 

3.95 

3.6 

7 

40 

0.7 

0.960 

0.959  ±  0.004 

0.5 

0.717 

0.727  ±  0.010 

4.0 

3.6 

8 

30 

0.6 

0.946 

0.943  ±  0.005 

0.4 

0.613 

0.626  ±  0.011 

4.7 

4.2 

10 

80 

0.5 

0.947 

0.937  ±  0.006 

0.4 

0.779 

0.770  ±  0.010 

5.0 

4.5 

10 

100 

0.45 

0.940 

0.938  ±  0.005 

0.3 

0.553 

0.550  ±  0.001 
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Table  5.1 


Significance  Level  of  Repeated  t-Test 


a 

mo 

m 

Analytic 

Approximation 

Monte  Carlo 

(2000  replications) 

3.8 

7 

30 

0.052 

0.053  ±  0.001 

4.0 

8 

50 

0.047 

0.048  ±  0.001 

4.5 

10 

75 

0.032 

0.033  ±  0.0006 

5.0 

10 

110 

0.024 

0.023  ±  0.0004 

importance  sampling  is  used  in  the  Monte  Carlo  experiments  above 


Table  5.2 

Powers  of  Repeated  t-Test 


a 

mo 

m 

n 

Analytic 

Approximation 

Monte  Carlo 

(2000  replications) 

3.8 

7 

30 

0.8 

0.946 

0.951  ±  0.005 

0.6 

0.734 

0.742  ±  0.010 

4.0 

8 

30 

0.6 

0.934 

0.933  ±  0.006 

0.4 

0.584 

0.596  ±  0.008 

4.0 

10 

75 

0.5 

0.950 

0.948  ±  0.005 

0.3 

0.518 

0.522  ±  0.011 

5.0 

10 

110 

0.4 

0.882 

0.889  ±  0.007 

0.3 


0.581 


0.581  ±  0.011 
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where  in  the  equality  above  we  have  used  Theorem  2  of  Chapter  2  to  obtain 
Mil.  6)  =  Ml«p(2«o/(l  -  *«.))  -  1]1/2}  •  {(l/2)log(l  +  fi2)/[02  -  (1/2)  log(l  +  *2)]} 

=  ^{[«p(2ao/(l  -  tf0))  -  l]i/2>  •  [«0/(l  -  t{0)] 

•  {exp(2o0/(l  -  tfo))  -  1  -  a0/(  1  -  t(o)}~1 
and  the  equality 

Itifid  -  if.)-*.  or17*  =  2-/!  ■  (6  -  f?d  -  it.)-2]-'11 

Next  we  consider  the  group  repeated  t-test.  The  stopping  rule  we  are  interested  in  here 
is 

r*  =  inf  jn;  n  =  mo  +  tA,»  =  0,  •  •  • ,  inM^r»/ri)  >  # 

where  k  is  the  number  of  observations  in  a  group.  It  is  easy  to  see  that  rj  is  a  stopping  time. 
A  moment’s  reflection  we  find  that  the  corresponding  significance  levels  and  powers  are  the 
same  as  k  —  1,  except  the  excess  over  the  boundary  part.  To  find  the  excess  over  the  boundary 
part,  it  is  sufficient  to  identify  the  increment  of  the  random  walk  which  generates  the  excess 
over  the  boundary.  In  this  case,  using  the  forward  method,  the  corresponding  increment  of 
the  random  walk  is  17*  =  Y;  where  the  distribution  of  Y,’s  are  given  in  (5.2).  Using  the 
backward  method,  the  increment  of  interest  is  V*  =  where  the  distribution  of  the 

Zi  s  is  given  by  (5.4). 

Tables  5.1  -  5.4  below  give  some  examples  of  the  approximation  of  powers  and  signifi¬ 
cance  levels  of  both  RST  and  MRST.  For  comparison,  the  results  of  Monte  Carlo  experiments 
are  also  included.  For  details  of  numerical  computation,  see  the  Appendix. 
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What  remains  is  to  approximate 


Since  (£j,  £2)  is  located  at  the  boundary  of  the  set  A,  the  multidimensional  Laplace  method 
does  not  apply.  The  argument  below  uses  a  chance  of  variable  to  convert  the  integral  into  a 
form  which  can  be  handled  by  iterative  approximation.  The  change  of  variable  is  suggested 
by  Fig.  1. 


/  J  ex p[-m(<&0(6)  -  )]<&<<& 

=  /  [  exp[-m(^o(y2)  -  nViV22\Vi2dy2dui  where  yx  =  1/2  1/2  =  6 

Jo  J  C\ 

-  f  exp[-m(^o(v2))]y2/2  [ f  exp(mnviV22)dyi  dy2 

J°  UCi  (5-23) 

=  /  «p[-m(^o(y2))]yi/2{(mr;y2/2)~1(exp(m»72/2co)  -exp(mn:'2/2ci)]}(fy2 
Jo 

~  /  cxp[-m(^0(v2))]  •  •  exp(mrjy\l2  eQ)dy2 

Jo 

=  (mi))-1  /  e— m°<Va)  cfj/2 

Jo 


where  the  function  </(•)  is  defined  by  (5.19).  Now  Laplace’s  method  can  be  used  to  approximate 

(mn)-1  «-"»»(»>)  <f„2  ~  (m.?rl[2jr/(m0w({))]1/2em»<^. 

Jo 


By  (5.22),  (5.23) 

^>»i{mo  ^  r  <  *».  A(5m/m)  <  e/m}  ~  exp[-m(a0  +  r?J/2  +  $(&)) 

•  »MiM)  ■  n-'P  W'(fc)  •  (1  -  *«,)  ■  !?(fc(i  -  *co)"l.^)ll1/2 

=  exp(-m(a0  +  172/ 2  +  $(£2))  *  t/+{(exp[2a0(l  -  t{0)-1]  -  l)l/2}  •  exp[3a<,/(l  -  t{o)]  •  a0 

n~l  ■  {exp(2a0/(l  -  t( o)]  -  o0(l  -  tfo)'1}-1  •  l2jrm(£2  +  (l/2)e0f7^2/2)(l  “  ^»)*)l/2 

(5.24) 
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Figure  1 


Since  g"(( j)  >  0  g({ j)  has  minimum. 

Let  =  i  be  the  minimum  satisfying 

x 2  +  e0rjx  -  1  =  0  =>  x  —  {[(corj)2  +  4]1/2  -  eorj}/2 

6  =  {Z/Kcgr,*  +  4)1'2  -  Co*?]}*  =  [(cW  +  <)‘/2  +  cor?]*/ 4  (5.20) 

fi  =  eo?2/J  =  eo*_1  =  y  (co*?*  +  4)l/J  +  e0rj-  (5.21) 

As  we  pointed  out  earlier  the  only  important  part  of  the  integral  (5.17)  is  the  integral 
over  an  arbitrarily  small  neighborhood  N  of  ((ids).  Part  of  the  intergrand  of  (5.17)  is 
approximately  constant  over  N,  namely 

(i  -  t{0)_i/si?[6(i  -  tforsfcir^-uo) 

w  (1“fyo)“l/*fflfi(1“fyo)“l»fj)rl/2,/-(fo)  over  N 
where  t(o  satisfies  (-1/2)(1  -  t*0)log(l  -  [&  •  1  -  [fy0)“1]2}  =  a/m. 


(5.22) 
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^o(&)  in  a  neighborhood  of  1 

Mix)  =  ^o(l)  +  ^f(l)(^2  -  1)  +  (1/2)^M(*)(&  -  l)2 
~  (l/4)(6  -  l)2 

so 

j  e~m*°^^d(2  ~  f  exp (—mxi/4)dx  =  (Ar-fm)1^.  (5.16) 

Jlm  J-00 

Substituting  (5.16)  into  (5.15)  we  have 
Po{mo  <  r  <  m,  A (Sm/m)  <  e/m} 

~ «-< <nM'f *  /  (1  -  ie.r1/1ir(fi(i  - i)«i. 

The  change  of  variable  log(l  +  82)  =  -  log[l  —  £|(1  —  t(0)-2]  transforms  the  expression  above 
into  2e-°(a/x)1/2/j°[log(l  +  02)]-1/2i/+(0)d0  in  agreement  with  (5.1). 

Now  let  us  bring  our  attention  back  to  (5.14)  with  q  ^  0.  The  integral  to  be  evaluated 
is 

/  j  [  1  -  tf„)-1/2|?(fi(l  -  «4,)-l,6|rl/2»'-(eo)exp(-m(^o(6)  -  ntiMidti-  (5.17) 
The  set 

A  as  {0  <  t(0  <  1  -  m0/m,  A(£0)  <  c/m.} 

=  {mo/m  <  U  <  1,  (1/2)  log{l/[l  -  fc/ej'2]}  <  c/m}  (5.18) 

=  {Cl  <  <  C0} 

where  t\  —  l-t(0  and  eo  =  (1  — e"«)1/*.  ej  satisfies  (mo/2m)log{l/[l  —  (eimo)l/m]}  =  oo- 
It  is  clear  that  the  only  part  which  is  of  first  order  importance  in  (5.18)  is  the  integral  over  a 
small  neighborhood  of  (|i,  $2)  where  (||,  £j)  minimise  fo(iz)  ~  nil  over  A. 

We  now  proceed  to  identify  (&,  (2).  The  following  picture  helps  us  locate  (&, &). 

It  is  clear  from  Fig.  1  that  the  minimum  of  4(iz)  _  nil  over  A  occurs  on  the  curve 
ii/i^2  —  Co-  On  this  curve  0o(6)  “  »?£i  i*  equal  to 


(1/2 )(6  -  1  -  log  U)  -  ne0i\/2  =  g(i2). 
^ 1  -  (1/2)(1  -  it1  ~  co r,Sl/2). 


(5.19) 
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Next  we  consider  the  problem  of  approximating  the  power  of  the  modified  repeated  T- 
test.  It  is  easy  to  see  that  the  powers  of  the  modified  test  depend  on  the  parameter  fi,  o1  only 
through  r\  =  p/o,  so  without  loss  of  generality  we  may  take  o  =  1  and  p  =  q.  By  symmetry 
the  power  at  rj  equals  that  at  -rj,  so  we  may  assume  q  >  0.  The  power  of  this  test  at  t),  by 
definition,  is  equal  to 

Pn{™ o  <  r  <  m}  +  Pn{r  >  m,  A (Sm/m)  >  e/m} 

(5.13) 

=  P„{*no  <  r  <  m,  A [Sm/rn)  <  e/m}  +  P,{A(5ro/m)  >  c/m}. 

The  second  part  on  the  right  hand  side  of  (5.13)  can  be  easily  obtained  by  approximat¬ 
ing  the  tail  probabiities  of  the  noncentral  T-distribution,  so  it  is  sufficient  to  approximate 
Pn{m0  <  r  <  m,  A {Sm/m)  <  e/m}.  By  Proposition  1  of  Chapter  2  under  Pn,  Sm  has 
asymptotic  density  in  the  following  form, 


~  (s»-m)“1|?(Orl/2exp[-m(0(fo)  -  nil  +  n7/ 2)1- 


Unconditioning  (5.2)  with  respect  to  )  gives 

e-(m/2r)  ff  (1  -  t(or^lf[ii(l  -  W^V-do)  #  x 
JaJa  |f,  (5.14) 

•  exp(-m(^0(6)  -  nil  +  n7/2)]diidii 

where  A  =  {0  <  to  <  1  -  mo/m,  A{io)  <  e/m},  ^4|f,  denotes  the  cross  section  of  A  in  the  & 
direction  when  ii  is  given  and  A  =  {&,  0  <  to  <  1  -  mo/m,  A(f0)  <  c/m}.  Observe  that 
when  n  —  0  (5.14)  should  reduce  to  (5.1).  The  following  argument  shows  that  indeed  it  does. 


When  q  =  0  the  integral  part  of  (5.14)  reduces  to 


(i  -  t(0r^  i?(6(i  -  t<9r\  i2)ri/7is-(io)e-m*°«’)di2 


dii. 


(5.15) 


Since  ^o(l)  —  0  =  min,^o(z),  it  is  easy  to  sec  thta  the  integral  over  the  interval  lm  — 
(1  -  cm,  1  +  <m)  with  limm— ao  (m  —  0  constitutes  the  major  contribution  of  the  inner  integral 
in  (5.13).  For  m  sufficiently  large  (1  -  t*0)-l/2|$ (ii{l  -  &)|“1/2i'-((o)  >3  effectively 

constant  (with  respect  to  (2)  over  Im.  We  still  have  to  evaluate  /.  e~m*a^ di?.  Expanding 
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The  first  term  on  the  right  hand  side  of  (5.10)  can  be  computed  easily  by  calculating  the  tail 
probability  of  the  t-distribution.  So  the  main  task  here  is  to  approximate  the  second  term 
Po{r  <  m,  A (Sm/m)  <  c/m}.  Using  the  backward  method,  the  procedure  of  approximating 
P0{r  ^  m.  A(Sm/m)  <  e/m}  is  no  more  complicated  than  that  of  Pq{t  <  m}.  The  only 
difference  is  the  range  of  integration  is  changed.  To  determine  the  range  of  integration,  let  us 
recall  that  in  obtaining  (5.8),  we  made  the  following  change  of  variable  log  1  +  62  =  -  log[l  - 
(z/l-U o)2l>  where*  =  &£'1/2,  *  and  tfo  satisfying  (l-tf0)log{[l- (*(l -^o)”1]-1}  =  2ao- 
Now 


{£  :  A(£)  <  c/m}  {z  :  (1/2)  log[(l  -  z2)  ']  <  c/m}  { z  :  \z\  <  0f(l  +  8\)  J} 
where  82  satisfies  c/m  =  (1/2)  log(l  +  8\).  Clearly 

1  ~  t(o  =  2a0[log(l  +  02))-1  =  log(l  +  02)[log(l  +  02)]'1 


8 1  satisfies  a/m  =  (1/2)  log(l  +  02).  Substituting  z  =  6\{l  +  0|)-1  and  1  -  tfo  =  log(l  +  9 J )  • 
[log(l  +  02)]-1  into  log(l  +  92)  =  log{[l  -  z2{  1  -  t{0)~2)-1}  we  find  the  lower  bound  of  the 
range  of  integration  0  satisfies 

$2 

1  +  8 2 

Of  course  we  take  only  the  root  $  >  0  of  (5.11).  Note  that  when  $2  —  81,  9  =  0i.  The  upper 
bound  of  the  range  of  integration  remains  the  same.  We  have 


log(l  +  8\) 


log(l  +  6 2) 


n 

1  +  62' 


(5.11) 


Po{mo  <  r  <  m,  A(Sm/m)  <  c/m}  ~  2(o/x)l^2e-0  J_  [logfl  +  Q2)\~ll2v+{8)d9.  (5.12) 


The  sharp-eyed  reader  may  discover  the  possibility  of  obtaining  (5.12)  by  modifying 
Siegmund’s  method,  mentioned  in  the  last  paragraph  of  page  32,  along  the  same  line  of 
arguments  on  pages  11-13.  This  possibiity  does  not  exist  because  there  exists  a  deep-buried 
measurability  problem.  To  make  a  long  story  short,  the  two  dimensional  sufficient  process 
(2Tr=i  2Tr=j  ^i)  **  not  measurable  with  respect  to  the  o-ficld  generated  by  the  maximum 
invariant  process  (X^XjyX^Xz,  ■  •  • ,  XJ"1^). 
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so  Y  has  density 

fr(v)  =  2(1  +  02)0"2  f)e-1/2#J(241)^(j!)-l(r(y  +  l/2)]-l2-t>+W 
i- o 

•  exp{(l  -t-  02)  •  [y  -  1/2  -  (1/2)  log(l  +  02)]<T2} 

•{0(1  +  0)  *[l  +  log(l  +  02)  -  2yp  l^2  •  l(-oo,l/2+(l/2)loc(l+#*))(y)- 
After  some  simple  algebra  we  find  that 

/y(y)  =  e~l  exp{(l  +  02)y  -  (202)-l[2  +  02  +  log(l  +  02)]}  •  £(0-4  +  0-2)' 

/=o 

•  [T(j  +  1/2)J— 1  •  2~i  •  (Jr1  •  {1/2  +  (1/2)  log(l  +  02)  - 
-Z  has  density 

OO 

f~z(v)  =  2  •  <T2  £ exp(— (202)“*(1  +  02))((202)-I(l  +  02)K(j!)-1[r(j  +  1/2)]"1  •  2 ‘O'*1/2* 

i=o 

■  exp{0-2[y  -  (1/2)  -  (1/2)  log(l  +  02)]}  •  {tf“2[l  +  Iog(l  +  02)  -  2 y]}'"1/2 

*  OO  4/2H+lot(l+*»)])(v) 

-  f-1  exp{0-2(y  -  1  -  02/2  -  (1/2)  Iog(l  +  02)]}  •  f  ](8~*  +  0~*)>  .  (>!)-* 

j-0 

■  [T(j  +  1/2)|-'J-'((1/2)(I  +  los(l  +  »’)]  - 

The  likelihood  ratio  of  Y  with  respect  to  —Z  is  surprisingly  simple  /y(*)//_^(z)  =  e*. 
Now  applying  Theorem  2  of  Chapter  2  we  have 


i/+(0)  =  J /-(6)pZ/fiY- 


(5.9) 


Simple  algebra  shows  that  pz  =  EZ  -  62  -  (l/2)log(l  +  05),  pY  =  PT-  =  (l/2)log(l  +  0J). 
Now  by  (5.9)  it  is  clear  that  (5.8)  and  (5.1)  agree.  Next  we  consider  the  modified  repeated 
t-test.  The  stopping  rule  is  still  r,  but  we  add  the  set  {roA(5m/rn)  >  e}  to  the  rejection 
region,  that  is,  we  reject  the  null  hypothesis  when  either  r<mor{r<m,  mA(Sm/m)  >  c), 
where  0  <  a.  The  significance  elvel  of  this  test  is 

Pu{r  <  rn}  +  P0{r  >  m,  A (Sm/m)  >  e/m }  =  P0{(Sm/m)  >  e/m} 


+  Pi{r  <  rn,  A [SM/m)  <  e/m). 


(5.10) 
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~z  =  (m  -  ~  *2)"1/2]  =  («  -  l)l/2(l  -  ^2)-J/2  =  (m  -  l)^[6/(e,  -  )]3/’ 

and  fj^2  •  [fj  -  £j(l  -  te0)_2]~s/2  =  exp[3ao(l  -  t(0)-1].  Substituting  these  results  into  the 
integral  above  and  using  Stirling’s  formula  on  the  gamma  functions  we  have 

e_a(m/ 2x)l/2  •  f  exp[3a0(l  -  tJo)_1](l  -  t^)~ll2v-\ia(z)\ix.  (5.6) 

J  [0<t(o<l-mo/m) 

We  need  to  make  another  change  of  variable.  Let 

log(l  +  $*)  =  -  log[l  -  (z/1  -  i{o)2]  =  2a0(l  -  t(o)~l.  (5.7) 

Observe  that  t(0  —  0  implies  6  =  (e2a/m  —  l)1/*  and  t(0  =  1  —  m0/m  =>  6  =  (e2o/m0  _  i)i/2j 
2j  —  (1  —  tf„)(l  +  ^)"S/2[log(l  +  62)  —  202][log(l  +  02)]-1.  Substituting  these  results  into 
(5.4)  gives 

PoWo  <  r  <  m}  ~  2 (a/r)l/2e"“  [  (log(l  +  02)]~1/2 

Jti  (5.8) 

i/_[*o(*))[2*2  -  log(i  +  *2)]  •  [iog(i  +  e7)\~ldd 

where  [^o(^)J  =  lim^a,  U{e“(Vr*~*)}  where  V„  =  £"=l  Z,-,  Zt ,  Z2,-*>  is  an  i.i.d.  sequence 
of  random  variables,  each  Z{  has  the  same  distribution  as 

Z  =  (02/ 2)X2  -  6(1  +  62)l'2X  +  (l/2)[*2  -  log(  1  +  02)],  X  ~  7/(0, 1). 

To  show  that  (5.6)  agrees  with  (5.1)  we  write  Y,  Z  in  the  following  form 

Y  =  — (1/2)02(1  +  62)~l(x  -  6~1)2  +  (1/2)(1  +  log(l  +  02)) 

Z  m  (62/ 2)(X  -  (1  +  62)l!26~1)2  -  (1/2)11  +  log(l  +  *2)] 

Let  xf(7)  denote  the  noncentral  x2*distributron  with  one  degree  of  freedom  and  noncen¬ 
tral  parameter  7.  It  is  easy  to  see  that  Y,  and  -Z  distributed  as  -(l/2)[02/(l  +  i?2)]x2(0-2)  + 
(1/2)(1  +  log(l  +  62))  and  ~(l/2)62x\(6~2  +  1)  +  (1/2)[1  +  log(l  +  62)\  respectively,  and  they 
have  the  same  support  (-00,  (l/2)[l  +  log(l  +  02)]),  hence  the  likelihood  ratio  01  Y  with 
respect  to  —  Z  exists.  x?(7)  ha*  density  of  the  following  form  (see  e.g.  Ferguson  (1967)) 

£ e-’/2(7/2V(j!)-1[r(y  +  1  /2))-‘2-h+»/2)  •  e^V-*/2  •  1(0.«,,(*), 
j= 0 
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x);  (b  ~  *)/(!  ~  *)]•  Straightforward  calculation  gives 


Z  =  VH{t(0ii)  •  (1,  y/bX,  bX*) 

«  (i/2)[6/d  -  <fo)]5{6  -  (6/(1  -  tfo)!2}-1*’  -  •  (1  -  *<„)]} 

•  ib  -  [Mil  ~  U,)?}-1*  +  (i/2){tf  (1  -  t(t )-2 


(5.4) 


•  [6  -  e?(  l  -  ^o)-2]}'1  -  iog{6(6  -  €?(i  -  te.)-2]-1}}- 

where  X  is  distributed  according  to  the  standard  normal.  In  order  to  obtain  the  significance 
levels  from  (5.3)  we  have  to  uncondition  (5.3)  with  respect  to  Pa{Sm  €  dm( o}.  Observe  that 
each  term  on  the  right  hand  side  of  (5.3)  is  a  function  of  z  —  hence  a  function  of 

y  =  (m-  1)1(,2z  •  (1  -  z2)-1/2.  This  reduces  the  conditional  probability  which  in  general 
is  a  function  of  two  variables  to  a  function  of  one  variable.  This  is  because  the  likelihood 
i-atio  statistic  is  invariant  under  scale  change.  It  is  easy  to  see  that  y  has  a  {-distribution  with 
(m  -  1)  degrees  of  freedom.  Now  multiplying  (5.3)  by  the  density  of  y  and  integrating  over 
the  appropriate  range  give 


«-  f  (f.  -  •  i&  -  f?(i  -  <{.r’rJ/1 

(l  -  tfo)-l/^-(eo)«mA(fe)y»-i(y)dv 


(5.5) 


where  ym- x(y)  =  r(m/2)[(m-l)jr]_1/J-(r{(»rt-I)/2)})-l-[yl/(m-l)  +  l)_m/J  is  the  density 
of  the  t-distribution  with  (m  -  1)  degrees  of  freedom. 


The  second  factor  on  the  right  hand  side  of  the  equation  above  cancels  with 


emA<«  =  «p{(m/2)log[6/(6  -  )]}  =  [b/(b  ~  =  [*2/(m  -  1)  +  l]"*'2 

and  (5.5)  reduces  to 

e-T(m/2)[(m  -  l)»|-‘/'[r{(».  -  l)/2})]->  I  (b  -  (l)’11 

[6  -  e?(l  -  t«,)-2]'s/2  •  (1  -  *{0>  /2^((o)dy 
*  e-T(m/2)[(m  -  l)x|-‘/8[r((m  -  l)/2}]~*  [ 

(b  -  e?)s/2[6  -  e?(i  -  t(«)-2)-*/2(i  -  ttl)rll2»-(b)T-dz 
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where  60  =  (e2a'm°  -  l)1/*,  =  (e2a/m  -  l)1'2,  n+(tf)  =  lim*-*  with  Un  = 

Hi=i  y«i  Vj ,  y2 ,  •  •  •  is  an  i.i.d.  sequence  of  random  variables,  each  having  the  same  distribution 
as 

y  =  -(0J/1  +  82)X2/2  +  {8/ 1  +  82)x  +  [02/(  1  +  82)  +  log(l  +  82)]/2,  (5.2) 

with  X  distributed  according  to  the  standard  normal  distribution,  and  —  inf{n,  U„  >  b}. 

The  original  forward  method  (see  Lalley  (19S3))  gives  the  same  result.  (Private  commu¬ 
nication).  Woodroofe  (1978,  1979)  contains  a  mistake  in  the  general  approximation  formula, 
so  his  results  on  repeated  T- tests  are  also  incorrect.  The  derivation  of  (5.1)  using  Siegmund’s 
method  is  simpler  than  that  of  the  original  forward  method  because  it  takes  advantage  of  the 
invariance  property  of  the  generalized  likelihood  ratio  statistic.  The  backward  method  also 
makes  use  of  this  invariance  property  in  an  implicit  way.  More  on  thi3  point  later. 

By  (3.5) 

Plm){mo  <  r  <  m}  ~  (1  -  tfo)-‘/2|?(eo)|1/2-  ,  % 

(5-3) 

I?  (6/U  -  t0),  6H~ 1  exp[-(a,T  -  A(£o)M  •  M6>  6) 

where 

-  i 0  =  (fa,fa),  a  =  a0m,  t(o  -  inf{t :  0  <  t  <  1  -  tn0/rn, 
(l-t)A(fa/(l-t),fa)*=a0} 

lp(£i,  6)  is  the  covariance  matrix  of  (W2,  W).  W  is  normally  distributed  with  first  and  second 
moment  given  by  E{W)  =  fa  E(W2)  =  fa.  Simple  calculation  shows  that  the  determinant  of 

$i»  l?(6, 6)1 »  2(6 -*?)*• 

*-(6,6)-  lim  «S!(«’,U),  i2ffl  =  (m-DA[(f-5r)/(m-T)]-fl 

m— 00  {0 

T  ~  inf{n;  (m  -  n)A[(£  -  Sn)/(m  -  n)]  >  a}. 

|m) 

By  Theorem  3  of  Chapter  4,  Rm  under  has  the  same  limiting  distribution  as  the 
excess  over  the  boundary  by  a  random  walk  with  increment  VH(t(0p)  ■  (1,  y/faX,  faX2) 
where  p,  =  (1,0,  fa)  is  the  mean  vector  of  (1,  y/fax,  fax2),  H(x,y,z)  —  (1  -z)A[(£j  -y)/{  1  - 
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An  Example:  The  Repeated  T-Test 


Assume  Xi,Xi ,  •••  are  independent  and  normally  distributed  with  unknown  mean  p 
and  variance  <rJ,  and  that  we  are  interested  in  testing  Ho  :  fi  =  0  against  Hi  :  n  £  0.  Let 
£Pi<t(-)  denote  the  log  likelihood  of  Xi.  Some  simple  algebra  shows  that  the  (generalized)  log 
likelihood  ratio  statistic  is 

(  e(2)  'i 

nA(S„/n)  «  n(<fr(S„/n)  -  MS P/n)  =  (»/2) log  |  J 

where  S„  =  (5^,5^)  -  (Etx  Xi,  EU  *!) 

4>(x 1,*2)  =  sup^^(zi,z2)  =  \[(xt  -  1)  -  Jog(z2  -  x\)\ 

MM  =  SUpfo,<r(*j)  -  [(*2  -  1)  -  log*2]/2. 

9 

The  repeated  T-test  is  defined  in  terms  of  the  following  stopping  rule  r  =  inf{n;  n  > 
mo,  nA(S„/r»)  >  a}.  The  test  rejects  the  null  hypothesis  if  and  only  if  r  <  m.  We  first 
consider  the  problem  of  approximating  the  significance  level  of  the  repeated  t-test.  Observe 
that  the  probability  Fo,*(mo  <  r  <  m)  is  independent  of  a.  So  we  may  write  Po{mo  <  r  <  m} 
for  significance  levels. 

A  variant  of  the  forward  method  (see  Siegmund  (1985)  for  details)  which  involves  taking 
the  likelihood  ratio  of  the  maximum  invariant  process  (y2,  •  ■  ■ ,  yn)  =  (zj-1^,  z^zj,  ■  •  • , 
x"[lxn)  then  mixing  it  by  Lcbesgue  measure  over  the  invariant  paramter  space  gives  us 

Po{m0  <  r  <  m}  ~  2 (a/T)l/V  /#°[log(l  +  f2)] ~l'*v+{$)d9 

Jti 


(5.1) 


Appendix 

Here  we  give  some  details  about  the  numerical  computation  performed  in  Chapter  5. 

To  approximate  P,,{mo  <  r  <  m,A(Sm/m)  <  e/m}  for  rj  0  we  use  (5.20).  For 
Po{mo{<  t  <  m,A.(Sm/m)  <  e/m}  with  e  <  a  we  invoke  (5.10).  To  approximate  the 
significance  levels  of  repeated  the  significance  levels  of  the  repeated  significance  test  Po{mo  < 
r  <  to}  we  use 

P0{m0  <  r  <  m)  =  Po{mo  <  r  <  m,  A(Sm/m)  <  a/m}  +  Po{A(Sm/m)  >  a}.  (A.l) 

Replacing  e  by  a  in  formula  (5.10)  gives  the  approximation  of  the  first  term  on  the  right  hand 
side  of  (A.l).  The  second  term  is  easily  computed  by  calculating  the  tail  probability  of  the 
t-distribution. 

To  complete  the  approximations  above,  we  need  to  compute  v+(8)  (the  excess  over  the 
boundary  by  the  forward  process)  numerically.  The  following  proposition  is  useful. 

Propositon  1:  Let  Yi.Yj,'"  be  independent  and  identically  distributed  nonarithmetic 
random  variables  with  a  finite  positive  mean  p  >  0.  For  b  >  0  define  S„  £"=1  Y;  and 
r*  =  inf{n,  Sn  >  6}.  Then 

lim  P{exp[-a(Sr,  -  6)]}  =  exp{jr-1  [  /  a*(aa  +  t2)-1t~1[7^(«)  -  jt/2 \dt 
*—oo  Vo 

-f  a(a2  +  t2)-l(P5(t)  +  log/it)dt]} 

Jo 


(A.  2) 


where  RS  and  IS  are  the  real  and  imaginary  part  of  £(t)  =  log[l/(l-/(t))]  with  f(t)  =  Ee,IYl, 
the  characteristic  function  of  Yi- 

Proof:  See  Woodroofe  (1979). 

Now  it  is  sufficient  to  identify  /(t)  for  Yj  given  by 

Y,  =  -(9* /l  +  9i)Xi/2  +  (9/1  +  9*)X  +  [#J/1  +  9*  +  log(l  +  9*)}/  2 

with  X  ~  N( 0, 1). 


Some  straightforward  algebra  shows  that 

f{t)  =  (1  +  $2)l'2(l  +  $2  +  i^a)-‘/2exp{(tt/2)llog(l  +  62)  +  62/{  1  +  «2)]} 

•  exp{(-l/2)*2*2  •  (1  +  02)-‘(l  +  82  +  it*2)'1} 

=  ex p{(l/2) tog{(l  +  e2)[(l  +  $2)2  +  t2fl4]-1/l>  -  (t*tf2/2)[(l  +  d2)2  + t2*4]'1} 

•  exp{(t72)[ttf2(l  +  62)-1  -  tan-1[ttf2(l  +  02)"1]  +  ts04[(l  +  62)2  +  t20*)-1 

(i+^2r,+tiog(i+tf2)j} 

so 

|1  -  f(t)\2  =  1  +  exp{iog{(i  +  02)[(1  +  e2)2  +  t2o*\~1/2)  -  t2o2{{  1  +  e2)2  + t2*4]-1} 

-  2exp{(l/2)  log{(l  +  02)[(1  +  e2)2  +  t2*4]'1/2}  -  (t202/2)[(l  +  e2)2  +  t2*4]"1} 

cos{(l/2){t02(l  +  tf2)"1  -  tan_l[t02(l  +  02)"1 ]  +  ts04[(l  +  02)2  +  t204]"1 

■  (i  +  e2rl  +  t\og(i  +  02))). 

It  is  easy  to  see  that 

RS(t)  =  -  log  |1  -  f(t)\  (A.3) 

and 

IS(t)  =  sin-1{[|/(t)|/|l  -  /(t)||  .sin{(l/2)[ttf2(l  +  02)*1 

(-4.4) 

-  tan-l[t02(l  +  02)“x]  +  t304[(l  +  02)2  +  t204]-1  +  tlog(l  +  02)]}}. 

Now  substitute  (A.3)  and  (A.4)  into  (A.2)  and  perform  the  numerical  integration  to  obtain 
"+(*)• 
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